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TO GERTRUDE 


PREFACE 


There are two major directions in the development of automatic control 
systems: 


|. Improvements in components 
2. Making the best use of the components 


The second aspect, which is generally known as system design, is the 
main topic of this book. 

Up until 10 years ago, most literature on control systems regarded com- 
ponents as no more than transfer functions. Admittedly, one of the most 
essential aspects of a component is its transfer function. Admittedly 
ilso, experienced system designers do not regard components as transfer 
functions alone. They use them merely for studying a closed-loop 
system’s stability and response to medium-sized signals, but in selecting 
‘omponents to be used, a fair amount of attention is also paid to the sys- 
lom’s power capacity, stability of its characteristics, internal noise, 
linearity, ete. While this double-pronged approach—linear stability 
(\heory plus common sense—has been highly successful and is still being 
ised by most servo designers, it is not adequate to cope with a complex 
system in which ultimate response characteristics are desired. To 
(dlosign such a system, experience is insufficient, and experiments are likely 
to be too slow. 

In the past decade, systematic treatments of the optimum-design 
problem, which considers one or the other limiting features of the so-called 
lixed (or unalterable) component in addition to its transfer function, 
hve appeared in the literature. In the author’s opinion, there are four 
husic methods in this category: 


|, Wiener’s least-square optimization with quadratic constraint and its 
inmnifestations in nonstationary and sampled-data systems 
2. ‘The maximum principle and its forerunner, the “bang-bang” servo 
principle 
4, Self-optimizing systems of various types 
4, Computer optimization and control of nonlinear systems 
vil 


viil PREFACE 


The present volume consists of a treatment of the above four basic 
methods plus two auxiliary topics: 
1. Estimation and measurement of power spectra and correlation 


functions 
2. An analysis of the changes in a closed-loop system’s response because 


of component inaccuracies 


The material selected is based on the author’s opinion of what addi- 
tional knowledge is most useful to a servo designer in his work, assuming 
that he has mastered the basic tools of the trade such as Laplace trans- 
forms, Bode plot, Nyquist plot, Evans’s root-locus method, ete. This 
additional knowledge is treated in the simplest possible way (known to 
the author), but the reader is assumed to be fairly well prepared at least 
in undergraduate mathematics. Knowledge of contour integration is 
essential and of probability theory and statistics desirable but not neces- 
sary, as the prerequisite knowledge on this subject is treated briefly in 
Appendixes A and C. Matrix algebra is used in some parts of the book, 
but a reader can understand most of the book without it. 

The book may be used as a graduate text by itself or as an auxiliary 
text in a graduate course based on one of the standard texts. It will also 
be useful as a reference book for servo designers and research engineers. 

It is the author’s pleasure to acknowledge the encouragement and good 
suggestions from Dean John R. Ragazzini of New York University, Dr. 
J. G. Truxal of Brooklyn Polytechnic Institute, Professor T. J. Higgins 
of the University of Wisconsin, and Dr. J. H. Chadwick of the Sperry 
Gyroscope Company. Many thanks are due to Miss Maryann Regan 
for typing most of the manuscript and also to Mrs. Mary Rooney, Mrs. 
Eleanor Gilmore, Mrs. Marie Trotta, and Mrs. Beatrice Schwartz for 
helping to make various revisions and corrections on the manuscript. 


Sheldon S. L. Chang 
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CHAPTER 1 


INTRODUCTION 


1-1. Some Remarks on the Technological Development of Feedback 
Control Systems. The modern theoretical development of feedback con- 
trol systems started in the 1920s or 1930s and is marked by Minorsky’s 
paper on the steering of ships (1922), Nyquist’s paper Saami 
Cheory (1932), and Hazen’s paper Theory of Servomechanisms (1934) 
Before that, the development of feedback controls was mainly in the 
hands of inventors. While there were isolated instances of successful 
applications of the concept of feedback control, such as Watt’s ie Sow 
(ions of the flyball governor to the steam engine (1788) Whitehead’s 
torpedo control (1866), and Sperry’s gyro stabilizer (1915) there were 
many, many more attempts that were left unrecorded anwea they failed 
I'he lack of theory prevented consistent success and economical desi 
toward a prespecified objective. hi 

I he theoretical developments made it possible for engineers to design 
watisfactory feedback control systems as daily routine, using such iat 
olassical methods as the Nyquist plot, Bode diagram, Nichols chart, and 
vans’s root-locus method. However, one common denominator of 
these design techniques is that they have been developed without critical 

consideration. Each method leads to one way or another of com es 
wating a system so that it is stable and satisfies a set of more nue 
arbitrary performance requirements, e.g., rise time, bandwidth, error 
coeflicients, peak overshoot, etc. As these reqnieeigaihe can be sniaked 
in many ways, the selection of system configuration as well as time con- 
wlants of the compensating networks is left largely to the discretion or 
experience of the designer. There is no place in the above-mentioned 
techniques for many factors which are known by experience to be sig- 
nificant, e.g., the torque-to-inertia ratio of a servomotor, the noise in je 
sensing elements, etc. Consequently, the question of weehi makes the 
heat system under actual operating environment and component limit 
(ions cannot be answered by these techniques alone. 4 
In the past one or two decades, while the classical design techniques 
have been reaching their fruition, the trend of research work has cade! 
toward optimization. This has been for good reason: The seein is no 


longer how to design one of many systems that work but to design the 
| 


a 


a 
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In many applications, notably fire-control and 


system that works best. 
uracy is too good; in missile- 


inertial-guidance systems, no degree of acc 
steering systems, no response is too fast. When we try to improve the 


accuracy and speed of response of a system, the ultimate limits, which 
are reached sooner or later, are noise and saturation. Any design tech- 
nique, if it is to be realistic, must take due account of these factors. The 
outstanding work in this direction is represented by Wiener’s theory of 
optimum filtering and prediction. However, as Wiener’s theory was not 
intended originally for control systems, it did not give due consideration 
to the power limitations of the components. This significant addition 
to the theory was due to Newton. Other developments of wide engineer- 
ing implications include the phase-plane technique or predictor control of 


Load disturbance D 


Controlled 


Reference input 
variable C 


and noise 


Controlled 
system 


System 
controller 


Measuring 
instrument 


Noise or instrument error N, 


Fic. 1-1. Basic components in a feedback control system. 


systems with simple saturation, the development of sampled-data sys- 
tems, and the recent developments in self-optimizing systems. The last 
is perhaps the culminating point of the trend toward optimization. An 
ideal self-optimizing system learns about its environment and adjusts 
itself to optimum expected performance in a continual process of measur- 
ing and adjusting. While developments in this area are still in their 
infancy, the importance of the self-optimizing concept cannot be over- 
emphasized. 

4-2. The Given Conditions in a Servo-design Problem. Perhaps the 
for the recent developments and trends in 
of the problems facing 
o define a number of 


compelling need or reason 
servo theory can be understood by an examination 
a servo designer today. In doing so, we shall als 
terms that will be useful later. 

A general representation of a contro 
controlled system may be a stabilized pl 


| system is shown in Fig. 1-1. The 
atform including a torque motor, 
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A Nic es including the control surface and its hydraulic drive, or 
ac ation of a servomotor, gears, and markin 
9 , gears, . Weassume that 
we may install instruments to measur i sen = like 
e whichever system variable we li 
» we like. 
However, seid relly the error or noise in measurement. The aie 
ence input may be transmitted from rem ied ; 
ce il ote places, with u id 
noise in the transmission link. Fi aaihion’ 
. Finally there are disturbing f 
moments applied to the load s cena 
ystem. The elements i 
degrees of inalterabilit 1 i Sante deere 
y. While a bigger and faster h i i 
better torque-to-inertia ratio i i HA ga a 2 
ja ratio in a motor-driven syst ie 
(o-noise ratio in a measuring i i ti ee 
g instrument is sometimes possible b i 
more expensive hardware, there is a limit Cie 
on what one can do b d 
cost consideration. Furthermore, i Taneen 
2 ( My , in order not to use expensive hard 
unne cessarily, there is a point in investigating what is the best o bie 
with the given hardware. ese 
s eign: the system controller with its amplifiers and compensating 
’ s, linear or nonlinear, is almost entirel i 
While an increase in gai ' i adidas sua ee 
} : gain that requires an additional i 
ae tha stage amplifie 
. ; * teat tall te it is nowhere nearly comparable to me pn 
a er hydraulic system. The noise in well-desi 
% -designed electronics i 
semi “n ein of magnitude lower than that of the measuring Sues 
ents and does not constitute a valid limitati rhe 
ated stitut ation on system gain. Th 
tale be amplifiers, it is by noise of various types, akan - 
‘viated by proper prefiltering. If an amplifier i | 
' plifier is saturated b - 
ling error signal before the c 1 ges 
ontrolled system is saturated i 
considerations usually dictate a geese 
change of the setu W 
way that, at least ins i ;: ine 
‘ ystems where high performance is i 
ape at a prem 
limitation of any kind should be imposed on the controller itself ney 


Chus V may classi y y 
MS Rhy ements nm a control 8 stem into three 


he . re A weet which is also called the plant, the fixed com- 
,, or the unalterable component, meani i ice j 
ee eae p , Meaning that its choice is usually 
2. The measuring or sensing elements 
3. The system controller 


ho first two i . 

: + oo two items are more or less given, and a servo designer’s job is 

: : _ y to design the system controller itself. However, that is only 
© first step. If we find it possible to design a controller te meet some 


“iven performance specification, the problem does not stop th 

are usually asked what is the best one can do vies the oA hard bes 
(items | and 2). Sometimes we find it very difficult to ae th ve ‘fied 
performance; then we must be prepared to answer the sestnekias x 
ill possible to meet the specified performance with the big waks on hand? 


a 
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Or, what improvement in the hardware is necessary in order to obtain the 


specified performance? 

1-3. Some Classical Fallacies. These questions are difficult or impos- 
knows only the classical techniques or theory. 
al notions of perform- 
t the abso- 


sible to answer if one 
Compounding this, we shall see that some classic 
ance and of what one can do with a system no longer represen 


lute truth. We shall confine our discussion to three areas: 


1. System response to input 
2. System response to load disturbance 
3. Effect of variations in plant transfer function 


Perhaps the best-known notion is the following: The most desirable 
system response is the one with the shortest rise time and adequate damping. 

The above is no longer true when noise is considered. A shorter rise 
time also implies a larger passband for noise, which increases the over-all 
error. Another point is that what one can obtain on paper is different 
from what one would obtain in an actual system, because of plant satura- 
tion and a number of other factors. A shorter rise time usually means 
larger transient as well as noise input to the plant for the same reference 
input. Once the plant is saturated, the system is likely to be more slug- 


gish than one which has a longer rise time on paper but operates in the 


linear range. A third point is that two systems with widely different 
shoot but a 


pole-zero locations can have the same rise time and peak over 
different order of magnitude of peak value of transient plant input. 
Considering the small-signal transient response alone is not enough, since 
a system with a lower peak value of plant input performs better when 
the reference input signal is large. 

The best response to load disturbance is the stiffest one. In other words, 
the most undisturbable system is the best. This is not always true, 
depending on the available torque, rate, or power of the controlled sys- 
tem. For instance, in a platform-stabilization system we nearly always 
nt torque in the torque motor to balance out the disturb- 
This is not so in the roll stabilization of an airplane or a 
gs so in a steering or attitude-control system. Conse- 
quently we try to balance out the disturbances as much as possible 
in a platform-stabilization system, the only limit to system bandwidth 
being instrument noise. In the roll stabilization of a ship we try to 
balance out nearly all the disturbing roll moment due to ocean waves 
when the sea state is not very heavy, so that the passengers can enjoy 
their voyage. However, in a really stormy sea, the roll moment is many 
times larger than the available stabilizing moment of the fins (or acti- 
vated tanks, etc.), and all one can do is to use the fins to damp out the 
esonant mode of the ship response to lessen the danger of 


provide sufficie 
ing moments. 
ship and is even le 


predominant rr 
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Prete Tf the servo is designed with unduly large bandwidth, it would 
a e€ paper tes would not serve any useful purpose a all. In 
stee and attitude-control systems, the rudd 
used to keep a mean course or atti Peesoonnsiel wepennnorwe A 
‘our’ attitude, as its moment and 
ment are not adequate to kee ‘ i mest See 
; ep up with the instant: i 
oe to ocean waves or atmospheric turbulence ee a 
D 1 : 
is ede etal she Sore oh point of view, because of load inertia 
ion or resonant effects, some Fouri 
- ; ourier compon 
oa hee disturbance cause far more change in the -diehed nae 
sete a : bi the sean forces or moments at our disposal are 
: roblem is to design not the stiffest 
tem but hich i 
properly selective, so that th he putleceal 
an ; ese forces or moments can be most effectively 
The closed-loop res 
ponse of a control system can be mad I 
Of e as ind 
dl pp er characteristics as one likes by using shunt iat . 
d increasing the gain of the inner loo Th - 
Se 00D. e problem of the sensitivity 
plant variations is one of th 
interest. An example of its i i ih oe ae 
res practical application is th 
ballistic missile. Within i Tg het Pee 
; a relatively short time of i 
the Mach number chan ese have ae 
ges from zero to 10 or high 
(hing undefined as the ai ahha, @ ee 
e air pressure changes from 1 atm t i 
_ o pract 
ok cone ve ere: Agia the mass changes as much as 10: | ware 
a ing shift of the center of mass. Obvi in. 
change in plant transfer char: isti , kay men ni 
acteristics; however, the closed-l 
response is required not only to be w saapeebs 
ell da 
ore specified performance at all times. he oe 
Ri eee would think that the problem could be readily 
g shunt compensaton with a large i i 
inner loop, can be stabilized by i i Sean ee 
op y introducing a sufficient numb 
: ' er 
a = Sonarig! of the plant poles and keeping the poles of the peter 
 Raiabat : | aang tee left. A close examination reveals that 
Sf nite relationship between the sensitivi i 
vystem’s responses to load di partenariat 
‘ isturbance and instrument noi i 
: on oar ap a possible existence of transportation lag or fest 
hase effect in the plant, impo imi i : 
Se vcthask, , imposes a limit on how far we can go with inner- 
: ect ee schemes such as conditional feedback or the use of 
iodel have been suggested, tried out, and i i : 
ahaa ted, tri , and published in the lit : 
Ah r, from be analytical point of view, they are not different eran. 
‘imple series-shunt compensation sys i 
é ! sat system. Th i 
rhe from a discussion in the next section Settee: tint none 
othe Relations of Linearly Compensated Systems. With 
0 lig. 1-1, a control system is called linearly compensated if the 
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It does not matter whether the controlled system 


controller is linear. 
In other words, we can 


and measuring instrument are linear or not. 
write an equation between Ry, Ci, and A: 


A(s) = Hi(s)Ra(s) — H2(s)C(s) (1-1) 


where s is the Laplace-transform variable. 

The above definition of a linearly compensated system can be general- 
ized to include systems that have nonlinear elements in cascade with the 
controlled system, since these cascade elements can be viewed as part of 
the controlled system. 

Our first basic relationship is that of equivalence: A linear system con- 
troller is completely specified by the transfer functions H,(s) and H,(s). 
Two system controllers with different configurations but the same H,(s) 


Fig. 1-2. Block-diagram representation of a linear system. 


and H.(s) are completely equivalent as far as every aspect of the system 
response is concerned (such as stability and responses to input, to noise, 
to load disturbance, etc.). 

When the problem is formulated in the form of Fig. 1-1, the equivalence 
relation is almost self-evident. The controlled system and measuring 
instrument are described by an equation relating ci(t) to all previous 
values of d(t), neo(t), and a(t): 

ex(t) = fld(t’),no(t"),a@”") #1 (1-2) 


There is no need to discuss Eq. (1-2) except to note that it is completely 
independent of the system controller. Because Eqs. (1-1) and (1-2) 
determine a(t), and consequently c(t), the dependence of the controlled 
variable c(t) on the system controller is only through the network func- 
tions H,(s) and H,(s). This proves the equivalence relation. 

In case the controlled system and measuring instruments are linear, a 


basié relationship exists between the sensitivity function, system response 
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to load disturbance, and s i 
; ystem respo i 

Se ee ponse to instrument noise. It can 
: os Fig. Ri G(s) and H,,(s) are transfer functions of the controlled 
sys em and measuring instrument, respectively. The load disturbance is 
I : presented by its equivalent value d,(¢) at the output end. Physicall 
dy(t) is the output variable if the control is completely cut. off The 
“98 ee 3 measuring instrument is represented by its ceaivaiont value 
no(t) of the measured variable. The closed-l 

: : -loop transfer functi 
ee C/D,, and C/N» are defined as the response in C due to R D, ne 

2 alone. From an inspection of Fig. 1-2, we have + 


Cc =e H,(s)G(s) 
Es ©) = TEHOF.WGe aii 
ye 1 
D, © = T FHWA) vo 
C () — _—Hals)Hm(s)G(s) 
N, ©) = T+ H@H.(G) ae 
lor an infinitesimal variation in G(s) or AG(s), Eq. (1-3) gives 
CG 
ee CO 1 
: (s) G(s) 1 + H2(s)Hn(s)G(s) seas 


‘he sensitivity function S is defined as 


G Gi 
as: (6) / 5 
AG(s)/G(s) pe 
i erent the ratio of per-unit change in C/R to per-unit change in G 
“a Biv es a quantitative measure of the dependence of the system transfer 
‘nection to the plant transfer function. Equations (1-4) to (1-7) give 


G: 
S= 5 (8) Ss 1+ 6) (1-8) 


| sae . the equivalence relation, Eq. (1-8) is completely inde 
wndent of the system confi i : 
aon y configuration. It holds as long as the system is 
" eeaness (1-3) and (1-4) imply that the transfer function C/R and 
4 1 can be independently selected. Tor any given G(s), we can find 
4 ae I to give the desired C/R and C/D. However, Eq. (1-8) shows 
ayy <p DARA of C/R to variations in G is closely tied to C/D, and 
» In order to make C/R insensitive to variations in G, the loop gain 
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H.H»G must be maintained at some fairly high value for all possible 
values of G. A sketch of the various gain functions is given in Fig. 1-3. 
The C/R function is approximately H,/H:Hm. The loop gain (H.H ma) 
varies as G varies and must be allowed to decrease gradually beyond C/R 
on account of system stability. Consequently, the system bandwidth 
to instrument noise is considerably higher than that of C/R, and the sys- 
tem’s stiffness to load disturbance is also necessarily high over a wide 
band. It isobvious that how far the noise bandwidth extends depends on 
how insensitive to plant variations the C/R function is made to be. 
Thus we see that instrument noise and load disturbance are limiting 
factors to the degree of insensitivity that one can achieve in the system 
response function to plant variations. Of course there are other limiting 


Gain, db 


with changing 


sus log w of various transfer functions in a system 


Fra. 1-3. Gain ver 
plant. 4 

igh-frequency resonant modes of the 
ffects are not very significant in the 
mit the extended gain band- 


factors, €.g., transportation lag and h 
plant. While these high-frequency ¢ 
passband of C/ R itself, they nevertheless li 
width of Hn»H.G. 


1-5. Scope of This Book. The above is an illustration of the various 


factors entering into a control problem and how they are interrelated. 
Present control theory does not give a cookbook solution of every problem 
that may arise, but it does provide a number of basic tools for solving 
these problems. To be more explicit, we have a number of idealized 
situations for which exact mathematical solutions are feasible. Some- 
times a control problem falls within one of these, and we have an immedi- 
ate solution. However, many problems do not fall into one idealized 
situation alone, and a direct mathematical solution is either too cumber- 


some or impossible with our prese 


nt knowledge of mathematics. The 
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sae NS methods that we use in solving idealized problems still can b 
vides EEA ceo effects of each individual factor involved and ne 
ome insight into what a good i 
ayy: good over-all solution should 
byes ° prs 2 ms aa our present control theory is not complete ie! 
‘ > thankful for this incompleteness, since wi i ' 
- e without 
sn would no longer be an interesting and challenging a ees 
8 . . : 
ge grag nee in Sec. 1-2, the optimum performance, in whatever 
sani ibs system may achieve is limited only by factors in the con 
ho sprigs or plant and in the environment of control, such as the 
eyes sige ni alte ot and noise, etc., and by the capiathleas 
f s variations of these factors. In th 
“ie i - In the subsequent chapt 
9 ane sa pnt methods of analysis and synthesis which print 
he opt performance and optimum di 
limitations. There are four pile ramtar anaes penadn aii ek 5 
1 ; . , . . . - 
4 ete 3 vn $1 i ayaiee with quadratic constraint, which 
8 the most useful for li i i 
aber or linear systems with or without random 
Seren a : sods 
ee ee s maximum principle for optimal control of a nonlinear 
e : e sense of minimal time or maximum range. A well-kn 
erunner and special case of the maximum principle is the ‘“b ne” 
servo principle. ee 
3. Self-optimizi i 
lat ptr arate heme or adaptive systems. A system of this kind 
sures, s, and adapts itself to changing pl i 
4, Optimization and cont: ced mpbethtieiy ee 
rol by the use of a digi ich i 
oo or less a catchall when everything else fails, seb sy 
ear a two essential phases of operation of a digital computer in 
at a ee of the optimum performance and the optimum 
setory an actual control of a syst ilizi 
eae lof a system, utilizing the computed data 
g and a digitized version of th i incipld 
represent the basic mathemati i isinw cotdpat ae 
— es upon which different computation pro- 
Besides the four basic m i 
ethod izati 
ae sages alley s of optimization, two closely related sub- 
i alysi 
.: ce ne a of random processes. The concept of 
’ , correlation functions, and spectral densi te 
lion and measurement of th ay tnapiy pony eavsiel oiae 
oot ese functions are an essential part of control 
” yovet 3G ] 
Sie = bs analysis. An analysis of the errors or discrepancies caused b 
| a ” ” inaccuracies on the closed-loop system response is as im a 
wut a factor in control engi i i i ie 
edi gineering as specification of tolerances is in a 
& «foe methods are useful not only for the design of complex s 
iw but also for routine design of relatively simple apeterak: one 
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instance, a graphical method realizing Wiener’s optimization with quad- 
ratic constraint is the root-square-locus method. Its complexity is not 
much greater than that of the root-locus method, and it can be used in 
lieu of the latter. However, it gives directly from the plant transfer 
function and specified rise time or servo bandwidth the compensating 
network that has the least tendency to saturate the plant, among many 
that can be used to meet the performance specifications. Thus the 
method is an improvement over conventional linear design techniques. 
Similarly, the concept of “bang-bang” servo can be used to help select 
nonlinear compensating elements for systems which are easily saturable, 
although a complete realization of the bang-bang principle is difficult for 
high-order systems. 

1-6. Nomenclature. There are some symbols and conventions which 
are used throughout this volume. The lower-case letters represent time 
functions. For instance, r(t) represents the reference input, c(t) the con- 
trolled variable, i(f) an unspecified input function, a(t) an unspecified 
system variable, and ¢(t) the correlation function. Corresponding capi- 
tal letters represent Laplace transforms of the time functions, for exam- 
ple, R(s), L(s), G(s), ete. G(s) and H(s) represent open-loop system 
functions and transfer functions of components, with g(t) and h(t) as 
the respective impulse-response functions. Pertaining to sampled-data 
systems, an asterisk indicates sampling, for instance, r*(t) is the sampled 
r(t), and R*(s) is the sampled transform, or transform of r*(t). R(z) 
means the z transform of r*(t). An asterisk over a complex number 
indicates conjugation. 

A bar over a time function represents averaging. A bar over a capital 
letter represents changing s to —s or 2 40i25 o 

There are some symbols which are used consistently in the optimiza- 
tion procedure: Y(s) and Z(s) are defined in See. 2-4. The symbols 
[ ],and{ }* are defined in Sec. 2-4. 

The other symbols are defined as the occasion for their use arises. 


CHAPTER 2 


OPTIMUM SYSTEM WITH DETERMINISTIC INPUTS 


2-1. Intr i i 
oduction. In this chapter we shall start with an exact solution 


of the problem of s izi 
ynthesizing a system with | i 
Reged thesizi east-int - 
ou er lets ote! limitations of the controlled had eae ss: 
“ Aa ili of not have wide application, its usefulness is patie i 
. quen sections to the design of systems which meet th ; 
oa see propa for response time, peak overshoot, ete ne teal 
kK aturation. The application of dis | a 
3 \ the method i 
( hie oe the graphical root-square-locus sa ee eee 
‘ e application examples are gi i 
tae given in the chapter to il 
* Oe ee sie its scope of application. It is arirating eae ee 
ae met od can be applied to all design bl ae 
ot-locus method is applicable civ eteeptiagia = 
2-2. S ; 
Tt ppt ate Least-square Optimization Problem. For th 
“hed ae - , the controlled plant with transfer function G( 7 
t st inear controller with unspecified configuration and ¢ 4 I 
; ontro 


Linear 
controller 


c(t) 


Rea ? P 
1G. 2-1. A linear system with unspecified controller 


eloments is to be designed i iteri 
sang gned. The design criterion is based on two figures of 


as I he integi il-sc uar [ b 0 put r(t applied at 
) € ib a y 
i 0} ’ 1 e error {su sequent t an in ( ) 


Ti = fy (ey dt = f° — e(oy at (2-1) 


¥. The integral-s 
» integral-square value of the control effort: 


Jy= i i [i(e)]? de (2-2) 
1 ; 


—_— 
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The significance of J, is as follows: 

1. Any error, positive or negative, contributes to the integral-square 
error a positive amount which increases with the magnitude and duration 
of the error. 

2. Bither a slow-rising characteristic or a large overshoot would 
increase the value of the integral-square error very significantly. 

By limiting or requiring J; to be a minimum, we are, in effect, putting 
somewhat, elastic restrictions on all three factors: response time, settling 
time, and peak overshoot. J: has similar significance relating to the con- 
trol effort. 

The design criterion may be either of the following: 

a. Minimizing the system error within the limit of available control 
effort: 

Ja = min: (2-3) 
Jo < Kz (2-4) 


b. Minimizing the control effort with the tolerable system error 
specified: 
J = 10m (2-3a) 
Ji< Ki (2-4a) 


There are two steps in the solution: 


1. Determination of the optimum control ratio (or closed-loop transfer 
function) (C/R)(s) from the design criterion, G(s), and r(t) 
2. Selection of a system configuration and compensating elements to 


give the optimum control ratio C/R 

The second step is more or less routine and usually can be done in many 
ways. 

2-3. Transformation into Frequency Domain, Parseval’s Theorem. 
To facilitate mathematical manipulations we shall convert Eqs. (2-1) and 
(2-2) into the s domain. Let a(t) be an arbitrary function which is zero 
for all negative values of t, is bounded for all values of t, and approaches 
zero at least as fast as e-*, as t approaches infinity, where e is a small 


’ 


positive constant. We shall prove that 
ie [a(l? dt = 5 Ns A(s)A(—8) ds (2-5) 
0 wT] J-i* 


where A(s) is the Laplace transform of a(t). 
The above relation can be proved as follows: One of the a(t)’s on the 
left-hand side of Eq. (2-5) is written in terms of its inverse transform: 


| * taped = os | ” a(t) | 7° A(s)ett ds dt (2-6) 
0 om) JO jo 
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€ t e sta 1 Eq. 2- can be integrated f rst with 
l nd T h ted cond tions for a(t) ( 6 

’ ) 
respect to is T he result 18 Eq. (2-5). The conditions on a(t) define a 
common region of conver gence for the Laplace tr ansfor ms A (s) and 


A(—s), as shown in Fi 
’ oo! oa ig. 2-2. We not i : 
irnaginiaay alain lier Uuniaehe ote that integration of s along the 


Overlapping 

region 

9$SQQAAA KXSXESRSY 
N Region of Rao Set pr gale 
SX convergence eetoae ence: a. S 
N - Benes convergence % 
NO. Z! Z 
pee |, alte tdeZ 

0 

Z 


CEO OOO ON 
senasonasocerocatoatonat 
sectatatatatate wore 
ve" x 
Mecgtetetgtcececs S389 
waseratorateratatene 
PRR RRS 


Se 


0; 


0, 
~ 


RES 
otetes 


* 


OS 


\ \\ 


F . : 5 
1G. 2-2. Overlapping regions of convergence on the s plane 


pS 
FS 
b 
pS 
ss 
bs 


RS 


Lot ’(s) denote the control ratio (C/R)(s). 


e() = r() and 7(é) can be written as pee ee 


C(s) — R(s) = [F(s) — 1] R(s 
16) = £8) = PORE) ‘ ie 
Gs) ~ —G@) ae) 


I'rom Eqs. (2- 
om Iiqs. (2-5), (2-7), and (2-8), Eqs. (2-1) and (2-2) can be written as 


J; =- ats Ss F — 
Sai | je MCS) — UN(—8) — 1R@)R(—s) ds (2-9) 
and Jp= at, [* FORK a)R(s) R(—8) 
2} } -j-2 G(s)G(—8) ds (2-10) 


4-4, Soluti 
olution of the Least-square Optimization Problem. Following 


Lawrange’s me the 
, . . | 
of a “ / B od of undetermined multiplier 8, Kqs. (2-3) an (2-4) can 


Jy + k’J_ = min (2-11) 


Nayiation (2-11 
| 11) ¢ feces P 
) can also be justified by simple reasoning. For each valu 
, AC e 
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. When 
f k2, there is a control ratio that satisfies Eq. (2-11), i : Ms pee 
cobeninted into Eq. (2-10), each F'o(s) bis nal ae il ne Rep. 
1 at yl 
i Josed-loop transfer functions : Ea, (2-11). 
: Sept the smallest value of Ji because F(s) satisfies Eq. ( 
2, 0k 


. lo west TJ 
To determine I o(s) we simply choose the one that gives the ’ 
’ 


’s satisfying Je < Ko. oo 
sgt toate a vee ‘nterested only in stable systems, and a age meet 
F(s) aa all its poles be inside the LHP (left half plane). 

itten as me 
F(s) can be writte coy Meus He : 
i f 
i i fs. Since all the poles 0 
i tant and F',(s) is a function 0 aloes 
es eee tae in the LHP interior, all the poles of F AY ne is hes 
Auge Laer On the other hand, given any F aga oy wl 
eH | i F(s) of Eq. (4 
interi d arbitrary constant A, : 
a ieagaieands Sao and is a candidate for the optimum system func 
. . . is 
tion F (s). Changing the sign of s in Eq. (2-12) gives 
F(-s) = Fo(—s) + MF y(—8) : 
i i he short- 
the following mathematical manipulations, we shall ape Farin 
h ee one Fo, Fi, R, and G to represent F'o.(s), F,(s), R(s), 
and no Fi, BR, t 


espectively nd the shorth nd notat ons /| I R and G to represent 
a a 1 Oks , ’ 
7] , 1 


Fou(—s), Fi(—-8) R(—s), and G(-s), respectively. Evaluating Eqs. 
=O) ee od) ae ’ 
(2-9) and (2-10) for F (s), we have 


+ = - + — 1)\(Pu + Pi — DRE 
(J kJ 2) F< ) p | re MF, 1) (I 0k DNL 1 ) 
1 8 


ie + My (Pat MF ne) ds = Jet+NJo+Jo) + Sa (2-14) 
; “ a FF wears RR ds 
where i= 35] .[¢*- 1) (Fox ey aa GG 
dm ij inp | fo tee al RRP, ds 
Sith oa eP | 2 a | RRP, ds 
Kua ay) iz ( es - FFARR ds 


. func- 
In Eq. (2-11) Ja is the value of Ja + kJ» with eet he pied 
1 ee ss ti bstituting —s ) 2 
‘ always positive. By su sient condition 
ee Tt Be abut to the latter. Hence, Be: Saat arbitrary 
for PF to give the lowest value of J; + kJ. is simply ) 


Pi, Jo =O. 
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In the following we shall show that this is also a necessary condition. 
If J; # 0 we can always choose a different » satisfying the following 
conditions: 

|. The magnitude of ) is small enough so that 274 < 2|dJ. als 

2. The sign of is such that J, < 0. 
‘Then for the F(s) defined by Eq. (2-12), with the new value of \, J; + k°J2 
is less than that for Fo.(s), a result which contradicts the definition of 
Pox(s). 

Now let us consider the equation 


ai 
[ | (re ay ae al RRP, ds = 0 (2-15) 
—jo 


ince all the poles of F1(s) are inside the LHP, all the poles of F; are inside 
the RHP (right half plane). It 


suggests a solution that all the 
poles of [ (Fox = 1) + °F ./GG|RR 
he inside the RHP. We shall 
romplete the path of integration 
of Inq. (2-15) by a large semicircle 
‘0 the left, enclosing the entire 
LIP, as shown in Fig. 2-3. Since 
all the poles of the integrand are 
inside the RHP, the contour inte- 
“ral vanishes. The behavior of the 
\nlogral at infinity remains to be 
pxumined. From the initial-value 
(heorem of Laplace transforms we 
way evaluate the limiting values Fic. 2-3. Path of integration and loca- 
mW the reference input r, controlled — tions of poles of #1(—s) and X(s). 
yirinble ¢;, and control effort 3, 

| wn infinitesimal interval 0+ subsequent to the step input: 


s plane 


Poles of F, (—s) 
and X (s) 


r(0+) = lim sR(s) 
c(0+) = lim sF(s) R(s) 


d : F(s) R(s) 

Fe ea RR 

Ihe above equations are valid for the optimum system function Fo(s) 
i well as for any competing system function F(s). Since r(0+), c(O+), 
0 }) are either finite or zero, the transform functions R, FR, FR/G, and 
their barred functions are all of the order of 1/s. The integrand of Eq. 
(7-15) is made up of products of these factors and is therefore of the order 


———————— << i“ | 
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of 1/s%. The semicircular part of the contour integral converges to zero 
as the radius approaches infinity. It follows that 


| Fa —-1)+ a | RR = X(s) (2-16) 
GG 

is a sufficient condition for Eq. (2-15). Here X(s) is used to denote an 
arbitrary function of s with all its poles in the RHP interior. 

Equation (2-16) is also the necessary condition for Eq. (2-15), because 
of the arbitrariness of F;. If the expression (Fo — 1) + ker o./GG\RR 
has any poles in the LHP, a function F, could be selected so that the sum 
of residues would not vanish; Eq. (2-15) would then be violated. Thus 
Eq. (2-16) is the necessary and sufficient condition for Fx(s) to yield the 
minimum value of J1 + kd 2. 

Equation (2-16) can be rewritten as 


* | es e. 
( fe =| RBFo — RR = X(s) (2-17) 


The function 1 + k2/GG is not changed by replacing s with —s. Hence if 
a is a pole (or zero) —@ must also be a pole (or zero). Consequently, 


the function can be factored into a function Y(s) and an identical function 
Y(—s) such that all the poles and zeros of Y(s) are in the LHP: 


1+ a = Y(s)¥(-8) (2-18) 


This process of factoring into two functions according to the pole-zero 
locations relative to the j axis in the s plane is called spectral factoring. 
The function that contains all the poles and zeros in the LHP (RHP) is 
denoted by braces { } around the original function and a plus (minus) 


sign at its upper right corner. We can write instead of Eq. (2-18) 
Y(s) = (! + at (2-19) 
GG 
Similarly, we can define a function Z(s) by 
Z(s) = {R(s)R(—8)\* (2-20) 
In order not to break the continuity of the treatment, it is assumed at 


this point that the functions inside the braces do not have any poles or 
zeros on the imaginary axis. This restriction will be removed at the end 


of this section. 
Substituting Eqs. (2-19) and (2-20) in (2-17) gives 


YYZZFu — ZZ = X(s) (2-21) 


Oo 
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Dividing Eq. (2-21) by YZ gives 


YZ ($93) 


fi p p os p . 
| he inction Z/Y can be ex ressed as a par tial fraction expansion 


4 = » A; + B; 
- isis is s— B; (2-23) 


: j 


In liq. (2-23) the a;s ar ? B; p 
4 e poles in the LHP and the sS are oles In { I 1e 
Le n ic efine represen he roces f 1a 


lractioning, retainin 
ue g only the terms wi 
respectively. Then, by definition s with poles in the LHP and RHP, 


LF, p » 8 : os (2-24) 
| P|. = >, res B; (2-25) 


Nquation (2-22) can be written as 


YZFu — B =e 
rl, vat lel ee 
ince all the poles of the ex 1 
7 pression on the left-h i 
are in the L ee : 
am . ~ HP and all the poles of the expression on th i . oer . 
1c RHP, we obtain as its solutiont 2 Figg ar o> 
NA ir E 
an (2-27) 
hae ote 
wake (2-28) 
tivo 1, is determined, J 
sade ai and J» can be eval i 
i rule, J; is an increasing function of k and J : ei lessee . - 
netion 


! The two sides of E 
, § q. (2-26) are e ] 
| land ; ? qual to an unknown function of s which is analytic 
hort a Paes Therefore it must be a constant To show that the constant 
“, we shall examine the limit of the function , 0k 
YZF , as s appr oaches infinity : 


You Z—R 
Therefore YZFu— k Bro = kr = *O+) 
8 


Min " h ) VA | on 4 re of e order of r Ss 
me | 1 " and Z y 
rehetant muat be [ / ]4 are th » 0 d ro 1/s as 8 app oaches infinity the 
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of k. The value of k is therefore given by 
J(k) = Ke (2-29) 
The above gives Fo(s) for the optimizing problem defined by Eqs. 
(2-3) and (2-4). The problem defined by Eqs. (2-3a) and (2-4a) can 
be solved in exactly the same way. Equations (2-11) to (2-28) are 
equally applicable to the latter case, while Eq. (2-29) is replaced by 


Ji(k) = Ky (2-30) 


Example 2-1. An inertial load is to be driven by a servomotor through a gear 
train. The transfer function of the motor and load system can be represented as 


C K 
o@) = OS = as (2-31) 


where c(t) is the load-shaft position in radians and i(é) is the control winding current, 
The value of the constants are (referring to the load shaft) 


Kr = 2 newton-m/amp 
M = 0.2 kg-m? 
R = control winding resistance = 8 ohms 


Under the heaviest-duty condition there is, on an average, one step input every 2 se¢ 
with an rms amplitude of 0.5 radian. The amplitudes of the successive step inputs 
are independent. In order to keep the motor-temperature rise within allowable limits, 


the average copper loss of the control winding should not exceed 10 watts. Design a 


system controller which gives the least-integral-square error. 


Solution. Since the successive step inputs are independent, their cross terms in 
the integral-square values of e(t) and z(t) average out, and these integral-square values 
can be calculated individually for each step input. In order not to exceed 10 watts in 


control-winding copper loss, we must have 


5 Ri}? dt < 10 watts X 2 sec 


or Jy” wor a < 296 = 25 (2-32) 


Equation (2-19) gives 
k?s4 + 
y(o) = {1 + Wat\* = 1 + as)(1 +b) (2-38 


where a = V/(k/10) /45° and b is its conjugate. 

Now R(s) = 0.5/s and has a pole at the origin. However, for all practical purpos 
we may write F(s) as 0.5/(s + e), where « may be the reciprocal of a billion years 
While an exponentially decaying function with so long a half period is practically th 
same as a step function, it nevertheless satisfies the mathematical requirement of no 
having any poles or zeros on the imaginary axis. Equation (2-20) gives 


Zs) = {**. 0.5 \ - 0.5 (2-34 


Z 0.5 0.5 
[sl], a [a + e)(L = as)(1 = |, “WT+ajd +b) +0 
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Substituting Eqs. (2-33) to (2-35) in (2-28) gives 


Fa = : 
(1 + as)(1 + bs)(1 + ae)(1 + be) 


Au « approaches zero, it becomes 


ry 1 4 1 
(I + as) + bs) ~ 14 a/0dks + Oks? si 
Substituting Eq. (2-36) in (2-10), we have 
@ < ie ue SS —0.0025s? ds 
7) J —j0 (1 + V/0.2k s + 0.1ks*)(1 — +/0.2k s + 0.1ks?) Ga) 


* a — e this eran oe be evaluated by contour integration. However 
: ecomes rather tedious when the order of the s is high, 343 

: N ystem is high i 
to use the Table of Integrals given in Appendix D. For the ikere tnlobad eae 
ave 


n=2 a2 = 
ay = 0.1k bo = —0.0025 
a = V0.2k b, =0 

TS na pea ees 


2X0.1k xX VO2k ~/80k 
lherefore k = 0.05. Substituting this value of k in Eq. (2-36) gives 


Pier ee ea eo 
o8) = 7FOis + 0.0058 ~ RK” (2-38) 


Ihe control ratio of Eq. (2-38) can be readily realized by a system employing tachom 


stor feedback, as shown in Fig. 2- i i 
ok . in Fig. 2-4. The integral-square error is evaluated from Kq. 


1 fie (0.18 + 0.005s?)(— 
aS .005s?) (—0.1s + 0.0058?) 0.5 
2nj J 72 (1+ 0.18 + 0.005s4)(T — 0.18 + 0.0055") s \~ a “ 
3 et 0.0025(1 — 0.0025s%) 
Qnj J —jo + 0.18 + 0.005s4)(1 — 0.18 + 0.00584) 


“xoxo, =~ (2-39) 


| In ee above example, we have considered poles at the origin as poles at 
' in the calculations and let ¢ approach zero in the final result. Exact 
ihe same results are obtained if, without using ¢, we split these ae 


Amplifier 
gain=20 


Tachometer 
H(s)=0.1s 


Fra, 2-4, Block diagram of an optimum system. 


ee 
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equally between Z and Z and treat the ones belonging to Z as if they were 
in the LHP and the ones belonging to Z as if they were in the RHP. The 
latter method has the advantage of simplicity in algebraic manipulations. 
In case there are poles and/or zeros at other points on the imaginary axis, 
they can be treated in the same way. 

Quite often the input to a system can be described as a random com- 
bination of step and ramp inputs or some other deterministic functions, 
and it is desirable to have the averaged values of J: and J» satisfying 

J, = min 

J 2< Ke 
or vice versa. The averaging process is weighted according to the fre- 
quencies of occurrence of the various types of inputs. The solution is 
the same as above except that Eq. (2-20) is replaced by 


OL = ss wiR:R; (2-40) 


In Eq. (2-40) wi is proportional to the frequency of occurrence of the 
ith type of input Ri. 

Another point worth mentioning is that the transfer function G(s) is 
that between the output variable and the system variable the integral- 
square value of which we wish to limit. The latter is not necessarily the 
conventional input variable. 

9-5. Some General Results. Let us examine the expression on the 
right-hand side of Eq. (2-28). As Y does not have any poles in the LHP, 
the poles of the function [Z/Y], are the poles of Zonly. In the complete 
expression these poles are canceled out by Z in the denominator. There- 
fore the poles of the optimum control ratio are the zeros of Y and Z only. 

For the two commonly used deterministic inputs, Eq. (2-28) can be 
put into even simpler form: 

1. Step Input 


R@) = 4 
Pa) = re) Fear = ro Or ‘- POS C4 
2. Ramp Inpul 
R(s =4 
Fuld) = Fe) aa = lero 5s Bod 
1+ Ts a 


= Y(0) Y(s) 
where T = Y’(0)/Y(0). 
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The constant Y(0) can be determined by letting s = 0 in Eq. (2-18): 
k? 
YO)? =1+—2—7, 
| [Y(0)] + Gop (2-43) 
If the transfer function G(s) has a factor s” (where n # 0) in its denomina- 


tor, 1/G(O) vanishes and Y(0) = 1. E i i 
Be (0) : quations (2-41) and (2-42) give, 


a pce 

Fo(s) = Ys) for step input (2-44) 
1+ Y’0 

Fou(s) = ip for ramp input (2-45) 


Iquations (2-44) and (2-45) will be given a physical interpretation. An 
over-all open-loop transfer function Go(s) is defined by s 


5 (s) at G(s) 


1 + G(s) ns! 
‘or an optimum system, 
1 cease 
Gir Fay 1 (2-47) 


We shall examine the behavior of G(s) for small values of s. The func- 
lion Y(s) is expanded into a power series in s: 


Y(s) =1+ Y'(O)s + }Y"(0)s? + --> (2-48) 
lor the step input, Eqs. (2-47) and (2-48) give 
1 — 14 uv 
Gols) = Y/Y (0)s + YY (0) s? aie fate (2-49) 
lor the ramp input, Eqs. (2-47) and (2-48) give 
Dohin ptis MeY"(O)s? + +: 

Ge) I+V@s 1” T+Y'Os a 
om pacing powers in Eqs. (2-49) and (2-50) are s and s?, respectively. 
erefore, assuming G0) = 0, the optimum compensated system is a 
ype \ system for step inputs and a type 2 system for ramp inputs. The 


aplimum C/R is 1/Y(s) for the former case and (1 + T's)/Y(s) for the 
latter case, and T is selected to make the compensated system a type 2 system 
If G(0) is a constant, the control system is a regulator system and is 
inherently incapable of following a ramp input indefinitely. The onl 
‘euningful problem then is the one with a step input. Baudtion (2-41) 


shows that /o.(s) has the same : 
s that F'o(s) has the s¢ : poles and zeros as 1/Y(s), and E ~ 
wid (2-43) give the d-e closed-loop gain as Set a ae 


iy" = ss 
PaO) rar (2-51) 


With /’(0) as given in Eq. (2-51), the steady-state value of ¢? + ki? 


asi 


———— <<< — “~~ 
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is easily shown to be a minimum. We shall leave the proof of this point 
to the reader. 

2-6. Systems Meeting Conventional Specifications with Least Likeli- 
hood of Saturation. The preceding sections give an exact solution of the 
optimization problem as formulated in Sec. 2-2. However, very few 
control problems are so formulated. The error criterion is usually not 
the integral-square error but a few somewhat arbitrarily defined descrip- 
tive parameters such as response time, settling time, peak overshoot, 
error coefficients, ete. The input functon is known only in some approxi- 
mate sense, and the limit on the control effort is usually not the 72R loss 
but a nonlinearity in response such as is caused by saturation, range 
limitation, etc. 

As we know, classical design methods do not consider the saturation 
problem. While it is possible to make new assumptions and rewrite the 
saturation effect and other design conditions into exact form so that an 
exact solution can be made, the solution is usually quite tedious and one 
wonders if it is worth all that effort. Presently we shall describe a 
method which does not give an exact solution of the saturation problem 
but is simple and is an improvement over the classical methods in that 
the saturation effect is at least considered in an approximate sort of way. 

The method is essentially a reinterpretation of Eqs. (2-3a) and (2-4a). 
The specified response time determines approximately a value of Ji. 
Tf the saturable system variable is taken as i(é) in Eq. (2-2), the likelihood 
of saturation is reduced by minimizing J2. We see that the design prob- 
lem is then the same as the one defined by Eqs. (2-3a) and (2-4a). How- 
ever, instead of determining K, from the response time and k from Ki, 
k is determined directly from the response time, as follows: In an approxi- 
mate sense, the response time is inversely proportional to the distance of 
the nearest pole to the origin. Of course, it also depends on the number 
of poles at the nearest distance and on the locations of the other poles, 
etc., but as a first approximation these effects can be neglected. Sup- 
pose that we define the response time ¢, as the time it takes the system 
output variable to rise to 95 per cent of its equilibrium value after the 
application of a step input. We find the following approximate relation 
quite useful: 


(2-52) 


tp = 


ela 


The requirement on response time can be satisfied by selecting a k value 
such that the nearest isolated pole (not considering the poles in dipole 
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the conventionall v 
y allowed 10 to 20 per cent. This i 
ws ally all 3 : is is an 
: te: ei minimizing the mean-square value of 7(t). ag seg i: ia 
: mere: of the closed-loop system is usually not that of the ee a, 
takes an excessively large control effort to force the plant i te 
ay ee plant into an oscilla- 
If exceptionally low over i i 
shoot is required, it can b i 
‘ e ach i 
| ng the pole-zero locations of the optimum system, as eae ad anes 
ie next section. bees) 
2-7. The Root-s 
quare-locus Method. Present] 
ane cus , y we shall 1 
NS paccctie for determining the control ratio of an fein ie : 
let - Pty ave shown that, with step and ramp inputs, the o mae 
4 rol ratio can be readily calculated from the poles and We of (s). 
seen Y(s) is determined, the problem is solved a ae 
— ' . ior chapter, we shall deal only with rational transfer func- 
o not have any zeros in the RHP and leave the problems of 
0 


(ransportation lag and nonmini 
imum i 
l'resently we can write G(s) as ae a 


K IT (s = 2) 
Gis) wee | US 
- (2-53) 
Il (s — pi) 


| ‘ 2 , > 
’ 4 “ey wee Zeros of Y. Let Q, Pi, Qi, and Z; represent —s?, —p,2 
j", and —2;?, respectively. Equation (2-18) gives Bie a 


x hs I (s — pi)(—s — pi) 
| +: K2 m 
I] (s — 2)(—s — 4) 
gut 
a k2 I ciiidinald 2 II Coty 
=t% ye eaieanataatinade Ts a re — 
I] (2;2 — s*) (Q — Z;) 
j=l j=1 


(k?/K*) [| (@-— P) + II (Q — Z;) 


= doar! 


ete ; m (2-54) 
pairs) is about +/t» radians/sec from the origin. [L.0.— 2) 
To keep within the limit of specified peak overshoot is not a problem. j-1 4 
We shall show in See. 2-8 that the peak overshoot of a system designed by The roots of 
; ’ y Q of the 5 ’ ; 
the present method generally has a relatively low value and is well below 1¢ numerator polynomial on the right-hand side of 


Wy, (2-54) » determi 
| 1) can be determined by the root-locus method, in which we take 


———O———  —& 
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[1 (Q— z;) | , [ I] (Q — Pi) | as the open-loop transfer function, and 
jel i=1 
K2/k? as the variable gain. The closed-loop poles so obtained are the 
Q,’s. Since the highest power of @ in the polynomial is ”, there are n Q,’s. 
It is also clear from Eq. (2-54) that 2;’s are poles of Y(s). Once the q’s 
are calculated from the Q,’s, Y(s) can be readily written as 
Y(0) [J G@ — s/a) 

i=} (2-55) 


Y(s) = =m 
I] G — 8/2) 


3 


By substituting Eqs. (2-43) and (2-55) in Eqs. (2-41) and (2-42), respec- 
tively, the optimum C/R for step and ramp inputs is readily determined. 
We note that in the above method the only step which requires some 
effort is the determination of the Q;’s by the root-locus plot. Since the 
plot is made in the @ plane, it will be referred to as the root-square-locus 
plot. Its advantage over the conventional root-locus method is that it 
starts with the pole-zero configuration of the fixed elements without speci- 
fying compensating networks and gives the closed-loop pole-zero con- 
figuration of the optimum system including the compensating networks 
for all values of gain. A design can be carried out in the following steps: 
Step 1. Given the transfer function G(s) with zeros @1, 22, +++» 


Ce a Ta and poles pi, Pz, + + + » Pa where n > m, calculate 21, 22, 
2m and pr, D2, + + + » Put 
Zi = —2;7 
Ra= —pi 


Step 2. Plot the root locus of the new function with Z;’s as zeros and 
P?sas poles. All such available means as the potential analog, a Spirule, 
ete., can be used. Corresponding to the same “gain,” the set of points 
on the various loci constitute a set of root squares and will be denoted as 

iS. 

Step 3. Select the Q,’s according to system gain or the desired high- 
frequency performance. There are two typical ways of specifying the 
latter: system bandwidth, #1, or response time, tp. If the bandwidth is 
specified, we shall choose the Q;’s so that the ones nearest the origin are 
at a distance somewhat less than 12. If the response time is specified, 
the nearest Q,’s should be at a distance of approximately (x/ty)?. An 
exact determination of this distance is unnecessary in most cases to begin 
with, since the system bandwidth and response time are not precisely 
defined, If a more precise specification is to be met, a second or third 
trial may be indicated, 
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Ste : 
ose es anna the closed-loop transfer function 
on of the Q,’s, the closed-loop poles are calculated as 
G@=+V/-Q 


£ f 
Che sign oO the radical 1S SO selected that the re ] component 6) qd: 18 
a f 7 


After the 


Tl . . . . 
1e control ratio which gives optimum step-input response is simply 


Il G - 8/2) 
A Se 
R fe (2-56) 
A I] (1 — s8/q) 
tied 


where A = 1 + k?/[G@(0)]?. 


Step 4a. If optimum 
perform: : : : 
new zero should be added to Eq. ene eo OR RE eee 


(1+ Ts) [] (1 — s/z,) 


= (s) = = 
, . (2-57) 
I] @ - s/q@) 


t=1 


Se Die ee 


Slep 4b. i i 
; i. x Bphictivtinciel if a larger velocity-error coefficient is required 
One ei: : origin can be added. The presence of the divele 
et ws ose enough to the origin, does not affect appreciably 
ey, ms nse. The physical justification for such a procedure 
Bs ccs ee EL response if there are also present other 
Oe vrcice ave a heavier concentration of power in the low- 


Lot qa and zg re 
af present the pole and zero of th iti i 
\, represent the desired velocity-error soceae. ee org 


(-2)-(-3)-YCD-YCD-k eo 


an] ) Oe ati . 

un ‘ wel equation gives one relation between zz and qa and leaves thei 
in 0 « ae eaves t 
# Aas ition to be determined by other considerations, such as th a 
- ‘ est range of the low-frequency input The mite aa 
» tnd qa can be obtained by specifyi ; vaginitis 

ane spec SPyeeey . ; 
isin later. Yy specifying the spectral density, as will be 
Nlep de, 


While the peak ; 
et the peak overshoot of a system designed by the present 
, In some exceptional cases the allowed peak overshoot may 


<< UU 
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be even lower. One way to “doctor” the optimum quan ? series awh 

magnitude of the characteristic frequencies unchanged, 2 ile — ° % 

of the characteristic phe ee rig =, oe ae 
i factor h < 1. is clos j 

Be cer ton its minimum value. A method ha nS ms 

increment of J: + k?J2 due to small deviations in o(s) is hi in an 
Step 5. Determine the open-loop transfer function. ‘ e 7 

transfer function Go(s) can be determined from F(s) as follows: 

Yee 

Go(s)  (C/R)(s) . 

Once Go(s) is determined, the compensating network and ened ae 

be obtained as the ratio Go(s) /G(s), where G(s) is the transfer func 

the fixed elements. 


+P ‘ ; ee 
Example 2-2. A hydraulic system for position control is to be eet ue 
response to a step input, it should have a response time of approxim i 


(2-60) 


w* plane 


j2 


Real axis 


@, 


i ted 
Fic. 2-5. Root-square-loci plot of an optimum system. P,, Pa, . 3 ae 
open-loop poles and Q:, Q2, Qs; are compensated closed-loop poles 1 


with a peak overshoot of 15 per cent or less. The transfer function of the hydraulie 
valve and servomotor system is “ 
@ = iF 0AG/an) + C/o) 


ot—A 

i 2 s author, Root Square Locus Plot 

is e le is taken from a paper by the au J] Lo i 
Wisantieg & 1 for Synthesis of Optimum Servo Systems, JRE Conv. Record, 


Geometrical Methoc 
1058. 
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where K = 200 radians/(sec)(in.) and w; = 157 radians/sec. Since the saturation is 
likely to occur at the hydraulic valve, its input is to be made as low as possible. The 
(ime constants of the solenoid and torque motor for actuating the hydraulic valve 
can be neglected. 

Solution. There are three poles at 0 and at (—0.2 + 70.98)w:, respectively. In 
plotting the root-square locus, it is expe- 
dient to use w,2 as the base dimension. J axis 
l'rom step 1, 

P, =0 
Pos = —(—0.2 + J0.98) 2x 1? 

= (0.919 + j0.392)w,? 


The positions of P;, P2, and P; are shown 
in Fig. 2-5. Taking Pi, Ps, and P3; as 
open-loop poles and plotting the root loci 
«is done conventionally, we obtain the 2 3 
three root-square loci PiQi, P2Q2, and : Real axis 
/’,Qy. In order to obtain a rise time of he 

Approximately 0.01 sec, the minimum dis- q, 

(ance is found to be (7/0.01)? = 4w,?.. The x —j2 x 

Hearest root ison the branch OA. There- 
fore Q, = —4,? and, correspondingly, 
(Jy = 4.840126754-2", Qs = 4.84e12e7 754.2", 
‘Taking the square roots of Q:, Qs, and 
(J) wives the six pole locations shown in 
Viv, 2-6. The ones in the LHP are the 
polos of the optimum control ratio: —2w;, 2.2w,ei1171°, 
obtain from Eq. (2-56) 


Fig. 2-6. Diagram showing selection of 
closed-loop poles of the optimum system. 


There are no zeros, and we 


e ioe 4 
R (L — s/qi)(1 — 8/q2)(1 — 8/qs) 
— 1 
~ (L +0.5(s/oi) 11 + 0.414(s/or) + 0.207 (s/a1)?] 
‘The open-loop transfer function is obtained from Eq. (2-60): 


ats “ (1 +05 ) [1 + 0.414 = + 0.207 (2) -1 


0.914% +4 0.414 (2)' + 0.1035 (2): 
Wi Wy) Qi 


Gols) = 1.09501 
s{1 + 0.453(s/w1) + 0.11(s/o1)?] 
‘The required compensating network is obtained as the ratio Go(s)/G(s): 


G (s) ‘a K,{1 = 0.4(s/w1) + (s/w1)?*] 
4 1 + 0.453(s/w1) + 0.11(s/w1)? 


Oa 1.096 = 0.859 in. /radian 


‘The above represents a resonant lead network, and Ka, is the cascaded d-c gain of the 
e/rorsensing element, the amplifier, the resonant lead network, and the torque motor. 


‘To compare the relative merits of the optimum system and a con- 
yentionally designed system, both systems were set up on an analog 


"i 
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computer and tested with step inputs. The latter is designed on the 
basis of a pair of control poles at 1.460, /+60° and a third pole at 14.601. 
These closed-loop poles are so selected that the integral-square error of 
the system is the same as that of the optimum system. G- of the con- 
ventionally designed system is 
Gin = 1.04{1 + 0.4(s/w1) + (s/o1)"] 
[1 + 0.615(s/1)][1 + 0. 069(s/e1)] 

Figure 2-7 shows the location of nonlinearity in the two systems. 
Figure 2-8 gives computer graphs corresponding to low inputs. The 
system input function is at the top, the system output is located in the 
middle, and the valve stroke is shown at the bottom. We see that, 
although the step-input responses of the two systems are comparable, 
the stroke is much lower for the optimum system. However, this is 


Frc. 2-7. Block diagrams of the two systems showing locations of nonlinearity. 
(S. S. L. Chang, Root Square Locus Plot—A Geometrical Method for Synthesis of Opti- 


mum Servo Systems, IRE Conv. Record, 1958.) 
Poe 
Co ae 


= ae 
eau 


ool EEE et 
Fh He eueaaee 
onl or 
eH eitttinee 


aS a seaetectget 


caf = HH Hint it 
02} 
EEE seit see K . 


ta Senees! eect jan 


(a) (b) 


Fia. 2-8. Step-input response for small inputs. (a) Optimum system; (b) conven- 
tional system. (S. S. L. Chang, Root Square Locus Plot——A Geometrical Method for 


Synthesis of Optimum Servo Systems, [RE Conv. Record, 1958.) 
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not reflected in the system performance, as neither system is saturated 
Vigure 2-9 illustrates the situation with large inputs. Although the 
inputs to both systems are identical, they exhibit different de sf f 
saturation because of the difference in design. These tests cle " i di 
cate the advantage of an optimized system. ech 


PEE 


Bay 


Tenmneeue peaseanaazaa : 


a/x b 
: La AEA 
Tha Saturation ea ape HEE HE Hoo it 


(a) = 


Vig, 2-9. Step-input res i 

ponse for large inputs. (a) Opti 
: Bs . ptimum system; - 
“ poe system. (S. S. L. Chang, Root Square Locus Plot—A TIE M 2 fo 
Synthesis of Optimum Servo Systems, IRE Conv. Record, 1958.) ae 


oe Asymptotic Limit at Large Gain. At this point, we are in a better 
position to investigate the general transient behavior of an optimum 
nyat “464 that is designed on the basis of minimizing the integral- (or mean-) 
aquare error with constraint. Obviously, for very low system gains the 
od is ng nee of the controlled system than of the design 
procedure. e shall study the transient r i 
ellen esponse of an optimum system 

Assuming Eq. (2-53) to be the transfer function of the plant, we deter- 
iiine the poles of the optimum system by the rocksanensloes plot 


ith reference to Fig. 2-10, in the asymptotic limit, m of the poles Q: 
vu, » » » » Qn approach the m zeros Z;, Zz, . .., Zm. The renugiatie 
4 i poles Qmnats Qmi2, ... , Q, branch out along asymptotic lines 
whi Lethe ad he — 1)/(n — m)|zx with the real axis, 1 = 1,2 
“ ~ m. In the limit, these n — m pol idi ¥ per 
hs saves ne taes poles are equidistant from the origin. 

c 1— (/)] TT 

= (8) (T See! te 

h Hi = w/q)| AA = re Pah 


j= 


— _ A~—~) 
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As Q; approaches Z;, 4; approaches 2;, if all the zeros are in the LHP (i.e., 
the fixed component is of the minimum-phase type). The factors in 
the first brackets become dipoles at the various zeros and do not have 
much effect on the transient performance. The balance of the poles are 
equally spaced along a semicircle of radius q in the LHP, as shown in Fig. 


s plane 


Real axis 


xD 


(a) 


Fra. 2-10. Root-square loci of a typical system. (a) Pole-zero configuration of an 
uncompensated system; (b) root-square-loci plot of the optimum system. 


2-11. These poles determine the response of the system. They are 
listed below: 


n—-m=l1: aU 

n—-m=2: — gett! 
n—m=3: —q —ger"* 
n-m= — getit!8, — gexisl8* 


Correspondingly, the system response following a unit step input is 
n-m=l1: CD ee 
1 — 1.414¢70-7°7# sin (0.7070 + ‘) 


n—m = 2: c(t) = 
n—-m=3: chal — et — 1.155¢7°-5# sin 0.866q¢ 
n—-m=4: c(t) = 1 — 2.414¢e-- 9% sin (0.383qt + 45°) 


+ 0-884 gin (0.924qt + 135°) 
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The above functions are plotted in Fig. 2-12. We see that in all 
cases the overshoots are low and the oscillations are heavily damped i 

In circuit theory, a network Roe 
function of this type is called a 
Butterworth function. Its steady- 3 
slate response decreases monoto- 
ously with frequency. To see 
this, one can readily derive from 
Inq. (2-61) that in the high-gain 
limit (n’ = n — m) 


j axis 


s plane 


Fah hauiiadl eeliseead 2 
Lis wi 

‘Therefore FPR Real axis 
Paes Cn, CaS 


While the relation between peak 
overshoot and maximum modulus 
is inexact, it is still quite safe to say 
(hat a system with a steadily de- 
creasing frequency-response char 
ae - Fie. 2-11. Pole- i i 

acteristic does not have much i rnens wih wey hae ee 
aber ch of optimum system with very high gain. 

5 I has gives an indication of the degree of approximation of 
My. (2-52). e product gt, is a function of n — m or the number of 


Mia, 2-12 ‘pei i 
0, 2-12, Step-input response of optimum system in the high-gain limit 


————O————— “~~! 
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isolated poles at the nearest distance: 


Me ete ht qtp = 3.00 
pene eae Os qty = 2.96 
Mie ae ee 33 qty = 3.51 
eae tp = 4.16 


We note that if the number of isolated poles at approximately the nearest 
distance is 1, 2, or 3 the approximation is quite good. 

2-9. Total Design. Sometimes it is worthwhile to determine the 
functional relationship between the performance parameters and the 
parameters of the plant, including the saturation limits, etc. For 
instance, we may wish to investigate the functional relationship between 
the response time of a controlled vessel and the size of the control sur- 
face or to determine the relative merits of two interchangeable means of 
control, e.g., a jet versus a fin, or a hydraulic motor versus an electrical 
motor, etc. Obviously, the system response depends not only on the 
power element but also on the controller. To make the investigation 
meaningful, a reasonably good controller has to be designed or tailored 
to fit each size and type of power element. This is usually a time-con- 
suming process. However, by using the method developed in the present 
chapter we can save time in making a large number of good designs. In 
cases where the criteria of performance and available control effort are 
exactly in the form of integral-square values, the functional relationships 
so derived are for controllers of the best possible design. Even in cases 
where these criteria are only approximate, as discussed in Sec. 2-6, the 
functional relationships are for controllers of uniformly good design, 
inasmuch as the uniformity in goodness can be gauged. 

In a relatively simple case the problem can be solved analytically, 
as will be illustrated presently in an example. In a more involved case, 
we should need a large number of root-square-locus plots. However, the 
effort is still far less than that of making a large number of designs by 
conventional techniques and trying them out on an analog computer. 

To illustrate the method, we shall use the motor-heating problem of 
Example 2-1 but assume a more general form of motor transfer function: 


a“ C(s) Pa Kr 
G(s) = 7s) ~ as + Ms? “- 


The relationship J1, J2, and Kr, 7, and M for an optimum system is to 
be determined. To obtain a solution, Eq. (2-18) gives 
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Sec Sim Ra s GF a oe“ 


OPTIMUM SYSTEM WITH DETERMINISTIC INPUTS 


. é 33 
The function Y(s) can be written as 


Y(s) = (1 + as)(1 + bs) (2-64) 


where +a and +b are roots 

: s of 1/s of the i : 

aid E ; expression on th ne 

side of Eq. (2-63). If kn?/2MKr < 1, the roots are fe oa gre 


a = rei? 


‘A 2-65) 

b = re? 
Leah 2-66 
Tae (2-66) 
Ky (2-67) 

cos 26 = fl 20 is 
IMR, < 90 (2-68) 
If kn?/2MKr > 1, the roots are real: 
= 6” 

: : Si (2-65a) 
(2-66a) 


The ¢ PAs ‘ 
1e constant 7 is given in Eq. (2-67), and the constant 6’ is given by 


cosh 26’ = ck 
OMKz (2-68a) 


The two fo i i 
sae sty, : 
ms are identical if imaginary values of 6 are allowed. Since 


vos 20 is always positive, th i 
, the locatio ithi 
the real axis. Equation (2-44) gives Ponca Pa Go ese 


‘ee 1 
BR) = (Fas + ba) oa 


Assuming R(s) = c/s, w 
e can eval i 
the table of Appendix D. The iatite ee Pani ya at to eae 


on (« +o + = 
z re 5) (2-70) 
i= h(a4— 2%) = SP (eae 
« ¢ a = 3 
2k? a+b) 2Kpr (« a ae ie i) AD 


Mubstituting Eq. (2-69) in Eq. (2-60) gives 


J; 


eee 1 
0(8) (a + b)s + abs? 


Pho gain : i : y y: 
he gain and time constants of the optimally compensated syst: 
em are 


or oe 
a+b caste 
. ab 


“a+b Gi.) 
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Using Eqs. (2-65) to (2-68), Eqs. (2-70) to (2-73) can be expressed in 
terms of dimensionless functions of 6: 


J, = an Cr (2-74) 
ern? r 
Jo = Kem CL (2-75) 
K.= u Cx (2-76) 
T= mcr (2-77) 
1 
om % , 
where Cr (cos 6+ coe (2 cos 26) (2-78) 
1 
= — —% - 
Cx (cos 6 Tis i) (2 cos 26) (2-79) 
_ (2 cos 26)% e 
Cx = “9 cos 0 (2a 
_ (2 cos 26)¥ a 
Cr = “9 cos 0 28h 


Equations (2-78) to (2-81) can be extended in range by substituting 
cosh 0’ and cosh 26’ for cos 8 and cos 26, respectively. The four coef- 
ficients are calculated for various values of @ and @’. In Fig. 2-13, Cz, 
Cx, and Cr are plotted versus Cz. For a specified integral-square error, 
the motor dissipation, gain, and time constants of the compensated open- 
loop transfer function can be read directly from these curves. Alter- 
natively, if the allowable dissipation is known, the minimum integral- 
square error can be read. The gain and time constants to use are the 
ones that correspond to Cz. 

2-10. Summary. In this chapter we have given a solution of the 
problem of minimizing the integral-square error with a specified limit on 
the integral-square value of a certain plant variable. The system con- 
figuration is completely free. The method is then extended to the design 
of systems which meet conventional specifications of response time, SyS- 
tem bandwidth, etc., with least likelihood of saturation. Its application 
is also made easier by deriving a graphical method, the root-square-locus 
method. 

The method is by no means above criticism in its treatment of the 
saturation problem. No calculation of the nonlinear effect is actually 
made. Only the integral-square value of the saturable plant variable is 
minimized under the assumption that the system is linear. Neverthe- 
less, it is an improvement over conventional linear design techniques. 

One difficulty in conventional design techniques is that they leave too 
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the designer. To meet the same performance criterion, a wide 
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St, i - of an inexperienced design is unnecessary sntiration of i 
4 tel wiry eo The root-square-locus method does not 
' » much more effort than Evans’s root- ; 
(loos away with this unwelcome prospect oot-locus method; however, it 
Another advs ed 
mt rt ot est of the method is that it can be used to investigate 
al trends or relationships between the performance of a elect 


i De p 
\y hin dw are requirements T he roblem of integr al-s uare error versus 
motor he ating 18 used to illustx ate the method . 


CHAPTER 3 


STATISTICAL PROPERTIES OF SIGNAL AND NOISE 


3-1. Introduction. This chapter gives the basic material of random- 
signal analysis. It is assumed that the reader has some previous knowl- 
edge of statistics or has read Appendix A. 

We begin by discussing the significance of mean-square error when the 
input signals are Gaussian. The merit of a control system is usually 
judged on the basis of some cost figure which increases with the absolute 
value of the error, but the exact form of the cost-versus-error function 
varies from application to application. It is shown in Sec. 3-2 that, no 
matter what form the cost function takes, a system with a lower value of 
mean-square error has a lower expected cost figure. The same cannot be 
said for systems with non-Gaussian inputs. In Chap. 2, we used the 
integral-square error as a convenient merit measure of a system’s transient 
response to deterministic inputs. Now we see that it is also an absolute 
merit measure of a system’s response to Gaussian random inputs. Since 
the inputs to a system are neither completely deterministic nor Gaussian 
random, the use of mean-square error as & merit measure is more reliable 
than it is in the purely deterministic case, although it is not so absolutely 
reliable as in the Gaussian case. 

Once we have decided upon the mean-square error as a system merit 
measure, the problem is then how to calculate the mean-square error 
with random inputs and how to design a system with least-mean-square 
error. The calculations are much simpler if the system is stationary. 
Section 3-3 points to the difference between stationary and nonstationary 
processes and introduces the ergodic hypothesis which states that, for a 
stationary process, the ensemble average of any quantitative measure is 
equal to its time average. This facilitates the analytical work of the 
remaining sections of the present chapter, which are concerned only with 
stationary systems. In Secs. 3-4 to 3-7 we show how to calculate the 

mean-square value of any system variable from some statistical properties 
of the inputs, known as power spectra and correlation functions, and how 
these properties are defined in terms of measurements made on the signal 
itself. The equivalence of power spectra and correlation functions is 
established by showing that one is the Laplace transform of the other. 
The problem of zero crossing is also discussed, 
uo 
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Piece tiges igh the power spectrum is not measured but estimated 
generating process of the signal. The method of doin is i 
given in Sees. 3-8 to 3-12. ad 
; try Random Signals in a Control System. In contrast to determin- 
* 6 pre 9S pr signals cannot be described as given faneiona of 
ep, a ramp, or a sine wave. The exact function i 
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Tf the fluctuation of a certain system variable x is caused by noise, 
turbulence, or other form of Gaussian random inputs, or if it is due to a 
rapid succession of deterministic inputs such as steps or ramps but the 
individual steps or ramps are applied so fast that the system has little 
time to settle, or if it is due to any combination of inputs and disturbances 
of the above description, p(x) can be closely approximated by a normal 


distribution function: 
(z = 3) 
exp be ae (8-1) 


p(x) = — ; 


In Eq. (3-1), @ is the mean value of x, and o? is the mean value of ( — Z)?. 
In most applications we are interested only in the variations of x instead 
of x itself; furthermore, positive and negative excursions of x from its 
steady state, or quiescent value, are equally likely. If we use x to denote 
the deviation of a system variable from its quiescent value, rather than 
the variable itself, Z is always zero, and o? is simply o? = x, We see 
that, for a system variable the amplitude of which is normally distributed, 
once its mean-square value is determined, its complete distribution func- 
tion is determined. 

If the system variable obeys normal distribution, the least-square 
criterion is the absolute criterion in the following sense: If we are inter- 
ested in keeping the system error within a certain tolerance limit for most 
of the time, the system with the least-mean-square error has the largest 
percentage of time within such a tolerance limit. If we are interested in 
keeping a certain system variable below a certain assigned maximum for 
reason of saturation or safety, the system with the least-mean-square 
value of the variable has the least likelihood of violating this restriction. 
These points can be established by proving the following theorem: 

Theorem. If pi(x) = (1/os)f(x/o1) and p2(x) = (1/o2)f(a/o2), where 
a1 > oz and f(x) > 0, and g(x) is a nondecreasing function of the absolute 
value of x in both positive and negative directions of &, then 


[2 pilo)g@) dx > 7, px@)ala) ae (3-2) 


proor. Let A denote the difference: 


A= : ia pi(x)g(x) dx — ie p2(x)g(x) dx 


“ io a 1(2) rot i A (=) g(x) dx 


Let ¢ denote x/o, in the first integral and » denote v/oy in the second inte= 
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gral; then we have 


A= [~ f(®g(o:8) dt — [*, fn)g(omn) an 
= [~, floors) — (028) at 


The function f(8) cannot be negative and must be positive for some 
value of & Since |o,é| > |oxé|, and g(x) is a nondecreasing function 
of ap g(ois) — g(o2&) is always positive or zero. Therefore A > 0, QED 

Che physical interpretation of the above theorem is as follows: The 
normal distribution function of Eq. (3-1) obviously fits the snails 
ol the theorem with f(£) = (1/+/2m) exp (— £/2). The function (2) 
is usually referred to as a cost function. Its functional form aesenae on 


(he application. For instanc 
; e, g(x) may represent th i 
” represent the error. Then j ; ivae Oa 


g(x) = 0 el ec hary 


at, >O 1S |2| <2 


|| < ty, it is a definite hit. With |z| > xe, it is a definite miss 

_ aes in-between value of x, the probability of a miss increases sith 
| e error. As another example, g(x) may represent motor heating and x 
represent motor current. Then g(x) is equal to a constant times x? 

It should be noted that the theorem holds as long as g(z) increases with 
in increasing magnitude of x in both positive and negative directions of « 
It is not necessary for g(x) to be symmetrical. For instance, in met 1 
working, if the positive tolerance limit is exceeded, the work iec i 
hojocted for rework; if the negative tolerance limit is extended the ic : 
\n rojected, period. The cost is obviously not equal in the two Sane 

Lhei ntegrals on the two sides of the inequality (3-2) represent tpeiied 
souls of the two systems. We may summarize the significance of the 
oo ” f were With a normally distributed system error (or some other 
ions : : 

- “aa ble) the system with the least-mean-square error has the least 
-— ler egiaid and Nonstationary Processes, Ergodic Hypothesis. 

processes encountered in control work can be classified as 
silionary and nonstationary processes. To gain some insight into the 


wihjeet, let us think of the underlying mechanism that generates a random 
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and the outcome of dice throwing is determined by a probability of }@ for 
each face of each die independent of all others. 

If the generating mechanism does not change with time, any measured 
average property of the random process is independent of the time of 
measurement aside from some statistical fluctuations, and the random 
process is called stationary. For instance, ocean-wave height in a given 
sea state, thermal noise, and a telegraphic signal of a certain language are 
examples of stationary random processes. If the generating mechanism 
does change, the random process is called nonstationary, in which case 
the generating mechanism may change in a predetermined fashion or at 
random. An example of the former is found in missile launching where 
the unwanted fluctuations in vertical and lateral thrusts are generated by 
one or a group of rocket motors of which the operating condition is a pre- 
determined function of time. Alternatively, the generating mechanism 
may change at random, as in the course of flight of an aircraft, where the 
wind disturbance depends on the local atmospheric condition which is 
sometimes calm and sometimes turbulent. 

The random processes or events generated by the same underlying 
mechanism form a statistical ensemble. When we say the averaged value 
of a certain quantity, we usually mean the ensemble average, which is the 
averaged value over all possible events weighted according to the proba- 
bilities of such events. However, for a stationary process we find little 
reason to distinguish between the ensemble average at any particular time 
and the time average of a single event. The life history of a single event, 
can be chopped up into an infinite number of events. Since these events 
are generated by the same underlying mechanism which does not change 
with time, they can be thought of as arbitrarily selected from the entire 
ensemble, and therefore they represent the latter. The assumption of 
equivalence between time and ensemble averages is called the ergodic 
hypothesis. As we shall see, using the time average instead of the 
ensemble average, and vice versa, is essential to our analytical work at a 
number of places. 

3-4, Basic Properties of Correlation Functions. The analysis of Sec. 
3-2 points to the logic or desirability of the least-square criterion for 
systems with random inputs. We shall proceed to derive the basic 
mathematics required in such an optimization procedure. Without loss 
of generality, we define the random signals as variations from their mean 
values, and therefore they have zero mean themselves. 

The statistical properties of a set of random signals (which may include 
the random inputs and system variables in response to these inputs) can 
be described by their correlation functions. The correlation function 
dap (tita) is defined as the ensemble average of signal a(t) at time ¢, and 
b(t) at time ty. 
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$av(tiytz) = (a(t)b(t2)) (3-3) 
When the two si ; 
gnals a(¢) and b(t) are diff 
cross- ; . ferent, Pav(ti,t2) 1 
: correlation function. When a(t) and b(é) om a 2) : called the 
“ ig is called the autocorrelation function e and the same, 
fe ; ‘ 
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Therefore the only signi i 
y significant independent i i 
shall denote as7. Instead of Eq. (3-3) we file tone i, potted 


gar(r) = a(t)b(t + 7) (3-4) 
The time interval 7 is call 
| ed the correlation ti 

properties of the correlation function follow directly atu Go 


baa(0) = [ay 


dav(—T) = doa(7) oe 
hese “on 
ab\T < $aa(0) $y4(0) 
|baa(7)| <= $aa(0) . arn 
Inequality (3-7) can be proved as follows: 
ja(t)]? [b(t + 7)]? 2la(t)b 
T Ps (t+ 
Pee , | dv (0) On 
- 2 ee Ws ep 
| V Goa(0)| | -Vd0(0) | ae 
Averaging the above inequality gives 
2| bav(7)| 0 
V daa(0) (0) ~ ri! 
Hi can be rewritten into inequality (3-7). 
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the magnitude of the cross-correlation function cannot exceed the product 
of the rms values of the two signals. 


Example 8-1. Letv = di/dt, where i(¢) is @ stationary random signal. Show that 


giv(0) = 0. 
Solution. By definition 


4000) = Ci GP 


4 Gap = 2100 GP 


since 

Taking the ensemble average of both sides, we obtain 
4 4.0) = 2oie(0) 
dt we — we 


The left-hand side of the above equation is obviously zero. 


3-5. Spectral Densities and Wiener’s Theorem. Quite similarly to 


the treatment of deterministic signals, we find it mathematically expedi- 


ent to transform the random variables into the frequency domain. Let 
the chopped signals ar (t) and br(t) be defined as the same random signals 
a(t) and b(4), respectively, in the interval 0 < t < T but zero outside this 
interval, and A7(s), Br(s) represent their Laplace transforms. Then 


L [Panne +n a= 7 [F ertome +n a 
0 0 
5 i * ar(t) | * Balser? ds dt 


~ OnjT 
1_ f?* Ag(—s)Br(s)e" d 
= OT ed 7(—s)Br(s)e" ds 
Performing the ensemble average over the above equation, we have 
T—?T 1 Lane E i 
a ee da(T) = oa Nie P (Ar(—s)Br(s))e ds (3-9) 


Let a function &a,(s) be defined as 


ba(s) = jim 7 (An(—8)Br($)) (3-10) 


As T approaches infinity, Eq. (3-9) becomes 


ee ie 
ar(7) = Oj ite Bap (s)e* ds (3-11) 
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in th : é 
a oe cracls Equation (3-11) shows that ¢a(r) is the inverse 
ransform of @.(s). It follows that ®.(s) is the Laplace trans- 


form of ¢u(r). Since, i 
pide 2 , in general, ¢a(—7) # 0, the Laplace transform is 


Bas(s) = ["_ das(r)e-* dr (3-12) 
T A ; 
he transform relationship between correlation functions and spectral 


densities as represented by the pai 
Wiener’s theorem. y the pair of Eqs. (3-11) and (3-12) is called 


b, (t) b, (t) 


Input 
signals 


av) b(t) 


Ita, 8-1. When the s iti i 
3- pectral densities of y 22) gi 
densities of the output signals a(¢) and roe ee eee 


|. Symmetry Properties. Equation (3-10) implies that 


Pav( jw) = Bya(—jw) 3-1 
Daal Jo) ae! Baa(—Jw) > 0 pee 


RR se and (3-13a) show that ®a,(jw) is the complex conjugate 
. ai and that Peal je) is a real positive even function of w. 

Tadpole eel Properties. In Fig. 3-1, a(é) is the sum of a number of 

s - i(t), ge eens an(t) through filters with transfer functions 
wl), ... , Han(s), respectively. Similarly, b(é) is the sum of a num- 


ae af ot 
wr of signals b\(), bo(t), . . . , b»(t) through filters with transfer func- 


The function ®a»(s) is called the spectral-density function when a and b 
are the same, or the cross-spectral-density funetion when a(é) and b(t) 
represent different signals. Its physical significance will be made clear 
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$e dQ os yb S12 mM We shall show that 
n m 
da(jo) = YY Ha — jo) Hn, (je) Pain (Je) (3-14) 
i=1k=1 


Equation (3-14) can be proved as follows: Let the chopped signals be 

defined as 

ari(t) = a;(t) 

One or 

bri(t) = b;(t) ao ae 

ari(t) = bri(t) == ii) <0 i> Mb 
Let ar(t) and br(t) represent the sum signals when the inputs are replaced 
by the respective chopped inputs. Then 


An(—jea) = J) Hoi(—ie) Ande) (3-15) 
Br( js) = ¥) Hoa(ju)Boa(is) (3-16) 
=1 


Multiplying Eqs. (3-15) and (3-16) and dividing the result by T, 
i=nk=m 
Yea) Hon js) An =e) Bra) 


i=1k=1 


4 An(—jw)Br(jo) = 
If we take the ensemble average over the above equations and let fi 
approach infinity, this becomes Eq. (3-14). 

We note that ar(t) and br(t) as defined here are the responses to chopped 
inputs and are not exactly the same as the chopped response signals which 
should be used in Eq. (3-10). However, the differences are appreciable 
only at the two ends of the 0 to T interval and become insignificant when 
we let 7 approach infinity. 

Two interesting special cases of Eq. (3-14) are the following: 

FILTERING PROPERTY. If a signal a(t) is passed through a filter with 
the transfer function H(s), anda signal e(t) is obtained at the output, then 


B.o(jw) = H(jw)H(—jo) Pi(Jo) (3-17) 


Equation (3-17) is obtained by letting ai(t) = bi(t) = a(t), Hails) = Hyi(s), 
andn = m= 1. 

ADDITION PROPERTY. Let all the H’s equal unity, a(t) = 6:6), and 
m = n; then Eq. (3-14) becomes 


n n 


SY Paa(ie) 


t=1Lk=1 


n mn i-1 
YS baa(io) + YY (Peels) + Paya(Jo)] (3-18) 
i=1 im2kel 


Pac (Jw) 
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: he spectral density of the sum signal is equal to the sum of the spectral 
densities of the individual signals, if these signals are uncorrelated; other- 


wise, the real components of the cross-spe tral densi 1e8 ve rise to n 
Cc 
: £ a 


Example 3-2. For the problem of Sec. 2-2, ©,,(jw) is gi 
: : . 2-2, ®,,(jw) is given instead of i 
integral expres f a “74) Ct ey ame 
art oe sions for the mean-square values of e(¢) and z(t) in terms of the control 
Solution. Since 
E(s) = R(s) — C(s) = [1 — F(s)]R(s) 
I(s) = C(s) = F(s)R(s) 
G(s) G(s) 


liq. (3-18) gives 
Be(jeo) = [1 — FGu)][. — F (jue) (ju) 
Bui ju) = EGO) (—Jo) Ser (jes) 
@Ga)G (— ju) 


Ity letting a(t) = b(t) = e(t), andr = 0 in Eq. (3-11), we obtain 


EOF = 4e(0) = as [7 deals) ds 


1 je 2 
= Sas je [1 — F(s)][1 — F(s)]#,r(s) ds (3-19) 
Mimilarly, c@)? = rs fi a rereere ds (3-20) 
TT: —jo AB = 


Example 3-3. A signal z(¢) consists of a random series of impulses: 


a(t) = y T,5(t — tn) 


n=—0 


where ¢, i 
' t, occurs at random but on an average there are v impulses per second, and the 
iienn-square value of J, is 12, Determine ®;;(jw) : 


Solution. Consider a large interval T’ and chopped signal ir(t): 


; vy 
Ip(jw) = . i(t)e-#t dt = > Tye itn 


; O<tn<T 
@i:(jo) = 7 > Te Taei@te-ts) > 
OS tritee 
We note that, as ¢, and tm occur at random, (e/n—*m)) = Q unless tr = t Therefore 
: 1 
Bi; ati Ss wie 
(jw) 7< > I, ? = rf? (3-21) 
O<ta <T 
Nxample 3-4. A Gaussian signal 7(¢) with zero mean has a spectral density 
, 10 
(jo) = =— 
(Gu) = bat FO 
Hetormine the fraction of time in whieh |i(@)| > 5. 
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Solution. ‘The mean-square value of i(t) is determined first: 


ee iv 10 
(8) = Tbr +O GF +8 +3)(s? —8 +3) 
s+ fe pen) Ge ae 
Qj Oe = se7 ue 8-9 


B= 2 
When 7? is known, the probability for \i| > 5 is obtained by integrating the density 
function: 


2 eee 2 [* 
oui > 5) = —— i e-(lte®) dj = e-** dr 
V2raJ5 Vx J 5/26 


5 
=1-® = =1 — (2.74 = 1 — 0.99989 = 1.1 x 10-* 
(Ss a ( ) 


The function &(z) is called the error integral. 


3-6. Physical Significance of Spectral Densities. In the preceding 
section, the spectral density is defined mathematically and used in caleu- 
lating the mean-square value of the output variable of a system with 
random inputs. However, it has a physical significance of its own which 
will be shown presently. 

In case a of Fig. 3-2, we have a narrow bandpass filter having a transfer 
function H(jw) such that |H(jw)|? is unity at wo and has an effective 
bandwidth Aw. The signal 7(f) is applied to the input of the filter, and its 
output a(t) is squared and averaged by a low-pass filter which has unity 
gain for d-c signals: 


1 bs : , ‘ 
[a(h]? = Dal fr H (jw) H (—jo) ®ii(jo) dw 
The phase angles of H (jw) and H(—jw) cancel out. If Aw is small enough 
so that ,(jw) does not change much in the passband, 


[atl = 2° w(jove) = 2 AfBa joo) (3-22) 


The factor 2 is caused by integrating over both the positive and negative 
ends of w. 

We may think of 7(¢) as an electric current passing through a load of 
1-ohm resistance. Then [7(t)]? is the total power, and [a(t)]* is the portion 
of power within the passband of H (jw). While from an engineering point 
of view we regard the bandwidth of H(jw) as Af, mathematically it is 
2 Af, because of the presence of a passband in the negative frequency 
range. Therefore, Pi(jwo) is the power content of 7(t) per unit bandwidth 
at w = wo. It is also called the power-density spectrum. 

In some textbooks Af is regarded as the bandwidth and the spectral 
density is twice the value that we use here. However, the present usage 
is becoming more common in the literature. 
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Similarly, for case b of Fig. 3-2 


GOOD = 5° [ial jee) + Biol —jeo)] 


= 2 Af X real component of ®;, (ja) (3-23) 


he real component of ©;,(jw») is called the cospectrum. 


the power of 2(é)v(¢) per unit bandwidth at wo. ee 


Similarly, for case c of 


i(t) a(t 
7 a? | Low-pass shh 
filter 
(a) 
Low-pass 


a(t) 


i(t) 


Multiplier 


i « 

— a(t) 
a 3 
Bo nd Os 5 See 
Low-pass SO Es 
filter 


Multiplier 


(jw) H| —jw) 
1 


(c) 
Aw 


w 
&o 


Nia, 3-2. (a) M asurement y 
$-2 e. m 8 of iS} 
_ . pectral densit ) (6 and c) real and imaginary com- 


Vig. 3-2, in which Z(jwo) undergoes an additional phase shift of 90° 


ab Af 
- (dD, — B,;) = i i 
; ( vi) = 2 Af X imaginary component of 8;(jwo) (3-24) 


wo 
The j 4 : ; 
© Imaginary component of ®;,(jwo) represents the reactive power of 


MMe) per unit bandwidth at i 
RG at wo. It is also called the quadrature 


ay she Zero-crossing Problem. In some applications, we are not so 
ch interested in the fraction of time for which 7(1) exceeds a threshold 
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level h as in the average number of times 7(¢) bing in h pier 

in Fi leulated as follows: Let »v = 
illustrated in Fig. 3-3. It can be calc i 
oT nti v) represent the joint probability density of i and v. The time 
that it takes i(t) to change from h to h + dh, or vice versa, 1s 


dh 


T=T7 


lv 


Let aN represent the number of crossings per unit time W ith di/dt in the 
g 
range v to v dv. Then T aN 18 the fraction of time for di/dt to be in 


Fig. 3-3. Crossing of level h by a random signal. 


the range v to v + dv and i to be in the range h to h + dh. But by defini- 


tion of the joint probability density p(i,v), this is p(h,v) dh dv. Therefore 
dN = 1 pth) dh dv = p(h,o)|o| dv (3-25) 

{3 
N(h) = [~, lvlp(hy) ao (3-26) 


For Gaussian signals, Eq. (3-26) can be expressed in a = ee 
correlation functions. Since ¢i(0) = 0 (Example 3-1), p(?, 


written as exp {—146[i?/oi(0) + 0?/bo0(0)]} 
piv) = Qe ~/ bii(O) bov(O) 


Substituting the above expression for p(hv) in Eq. (3-26) and integrating 


give 1[ ¢.»(0) \ —h2/ 26450) 
Se) =A é ne 
Nims $is(0) 


ll 


where N(0) is the number of zero crossings per second: 


wv 


1 | deo(0) | 3-28 
wo) = F [$50 | = 


In Eq. (8-27), N(A) is the number of times i(t) crosses the level A in both 


N (O)e-¥/ 268 (3-27) 
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directions. Therefore, N(h)/2 is the number of times a(t) goes over h, 
and N(h) is the number of times |i()| goes over h. 


Example 3-5. For the signal i(¢) of Example 3-4, calculate: 
(a) The number of zero crossings per second. 

(6) The number of times per second that |Z(t)| exceeds 5. 
Solution. Since V(s) = sI (s), Eq. (3-17) gives 


—10s? 
Boole) = be +9 
1 jo —10s? 10 
$o0(0) = 5 hl +5 +9" ~ox1x1 75 
As calculated in Example 3-4, $::(0) = 54. Equation (3-28) gives 
3 
N() = Sf = 0.551 sec™! 


quation (3-27) gives 
2 
N(5) = NO) exp (- Ea) = N(0)e-™® = 3.1 X 10-4 see71 


3-8. Generalized Shot Effect. There are two types of random inputs, 
(he ones that we can only measure and the ones that we define by a set, 
of probabilities or conditional probabilities. An ensemble of the former 


Cathode i,(t) 
Plate 
\ 
Electron 7) 
(b) 


oo--- 


(a) 


Nia, 3-4, Shot-effect current pulses of (b) are due to individual electrons in transit, 
fw shown in (a). 


Honsists of measured records of a random variable under well-defined 
physical conditions, as well as the infinite number that are not measured 
lit probably exist under the same set of conditions. One example is 
\he pressure record of atmospheric turbulence. The spectral-density 
function is found to depend essentially on two parameters, rms gust 
velocity and mean length of turbulence. These two parameters are then 
twed to define the ensemble. Alternatively, the condition that defines 
the ensemble of records of internal noise of a radio receiver is simply the 
fecoiver itself. The analytical problem here is how accurately we can 
Hlelormine the ensemble-average power spectrum from the limited num- 
her of sample records that we measure and analyze. This problem is 
faken up in Chap. 7. 


--— 


trons emitted is very unlikely to be exactly TI,/e. 
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als defined by a set of probabilities, the infinite collec- 
tion of sequences of outcomes that obey the set of probabilities constitutes 
an ensemble, and each sequence is a member of the ensemble. In the 
subsequent sections, we shall develop methods for calculating the ensem- 
ble-average spectral densities directly from the set of probabilities. 

One special class of probabilistic ensembles is of utmost interest in con- 
trol work. It is referred to in the literature as generalized shot effect. 
We shall use the shot effect to illustrate its significance. In Fig. 3-4a, a 
plate current 7 is induced by discrete electrons traveling from cathode to 
the plate. If it is assumed that the space-charge effect, or the interaction 
between electrons in transit, can be neglected, the induced burst of cur- 
rent of each individual electron in transit is then independent of that of 
all others, as illustrated in Fig. 3-4b, Let f(t) represent the induced 
current of an electron emitted at t =.0. The total plate current is 


ip) = Sift — m0) (3-29) 
l 


For random sign 


Since the emissions occur at random, the exact time of emission, 71, of 
each individual electron is not known. However, the average number of 
rs per second, which is the same as the average number of electrons 
emitted per second, is known as [ ,/e, where J, is the average plate current 
and ¢ is the electron charge. Thus i,(t) is defined by the function f(é), 
the randomness of 7, and the average number of 7’s per second. It 
should be noted that, in any given period of time 7, the number of elec- 
It usually differs 
from this expected number by a small margin. 
Equation (3-29) can be generalized as follows: 


N cy 
a(t) = , es uafa(t — TH) (3-30) 


where both a, and 7, are random values with known probabilities of 
amplitude distribution and of occurrence, respectively, and f,(t) = 0 for 
all t <0. Equation (3-30) represents a composite series of recurring 
events the nature of which is represented by the functions fx(¢). In 
some cases, there are fixed instants of occurrence. However, in most 
cases we know only the average number of occurrences per second but not 
the exact instant of occurrence. Furthermore, we know beforehand only 
the probability function for various amplitudes a, but not the exact 
amplitude. Sometimes, the amplitude distribution depends on previous 
amplitudes. For instance, the transverse position of a zigzagging air- 
craft is usually considered a random series of ramp inputs, while a dial 
position is usually considered a random series of step inputs. However, 
there is a fundamental difference in the randomness of the two examples, 
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In t i 

: : ve gory i Fav eae of aircraft velocity in the transverse direc- 

weit ; . ina teen 8 each subsequent input depends on the 

ae . the speed is already at its maximum, the next input 

ee iryvigid direction. In the latter example, the dial 

ey ae y number of turns in one direction, and each in d 
pletely independent. dae 

To calculate its s ity, ¢(t) i 
Whig de A density, z(t) is broken down into a number of 


) 


a(t) = » Gef(t — Tin) (3-31) 


l=—0 


n 


Then i) = Y a) (3-32) 


k=1 


I nh man y applications the v7 t S are indeper ident OF | 10 } 1e! 
| ‘ 3 18 ® ; 
: ( ) ’ AC ) ‘ f ac t ty and 


#;;( 1) = , int, lw 
jw) 2, Pini, (Jw) (3-33) 


Where (jo) and S4,(5 

me ye ; 

—ailie» vix(Jo) are spectral densities of i(¢) and a(t), 
lo determine ®;,;,(Jw), we shall de- 

fine a unit step series: 


u,(t) 


un(0) * DY auu(t — ru) (8-34) 
co Fra. 3-5. Relation between a random 
series of known time functions and 


where u(é) is th i i 
(t) e unit step function. corresponding series of step functions 


4 t apply uz(t) at the input end of 
i filter with transfer function sF’,( 

i : ”.(s), where F;,(s) i 
ul f(t), %(t) is obtained as the eubpat (Fi ott i ce eee 
ef g. . From Eq. (3-17) we 


3 Pix (Jo) = oF (Jo) F x(—Jw) Bvyu,( jo) (3-35) 
NWquations (3-33) and (3-35) can be combined as 


Pii(Jw) = w? 2 Fi(Jw)Fe( —Jo) Puyu, (Jo) (3-36) 


a ‘ 

. oe Se we shall study the statistical properties of a random 

_ step inputs. The spectral densities of the more general cl 

A = inputs are then calculated from Eq. (3-36) i Rie 

— gl, stomp of Recurring Step Function with Fixed Occur- 
andom Amplitudes. Let ¥(¢) be a random function of time 


fe illustrated in Pig. 3-6 
intrated in Fig. 3-6. The value of y(é) is a constant Yn in the interval 


We 1)7' we i 
<t<nT,. We may consider y(t) as a series of step inputs 


YY F 
52 SYNTHESIS OF OPTIMUM CONTROL SYSTEMS STATISTICAL PROPERTIES OF SIGNAL AND NOISE 53 


peiriog atid», 1, 2B on CyMmbys s 
Ce ty... eae 
An = Ynti — Yn (3-37) 


. , with amplitudes... , 


In some applications, where the input function can be represented by 
a random series of step or ramp inputs, y, has direct physical significance, 
since it represents the position or speed. In other applications, where the 
waveform factor f,(t) of Eq. (3-30) is, for instance, an exponentially 


C, 


Fie. 3-7. Normalized autocorrelation function. 


The spectral density ®,,(jw) can be calculated from Kq. (8-12). Since 
py, (7) = duy(—7), 


Fic. 3-6. Recurring step inputs with fixed occurrences and random amplitudes. 


Py, (Jw) — | °* ew(r) —jor I 
nthe See digi Pg Tea 


TY ? q 
= [ pS eet (2 = ) c+ To: | eer" dr! 


by(0) = Ya? (3-38) + complex conjugate 


decaying function, y, of the corresponding unit step series would lose 
much of its meaning, since it is no longer a measurable, physical entity. 
The most significant quantities then are the input amplitudes an. 

For a stationary time series, yn? is obviously independent of n. We 
have 


and YnYn41 depends on lonly. Let the correlation coefficient c be define 2 > | (- J erie of bible, a Sickert 
as Fy w wT wT : 
YnY n+l 
em See 3-39 
* b(0) 


o 

+ J g-iw4yr een gy ee ee 
ae + ope apo | Coes 
=0 


+ complex conjugate 


Re cae ee | 
.: [2+ apa - om 


«o 
1 , 
+ », 5 Qe-iwlT — e-so4)T _ ewi-or|} C1 
l=1 


+ complex conjugate 


Once the sequence of c’s is known, the autocorrelation function an 
spectral density of y(t) can be calculated as follows: If r = IT, obvious, 
y(t)y(t oe T) = YnYntl. : 

In general, IT <7 < (1+ 1)T. Let r= IT +7’. With referen 
to Fig. 3-6, when fis in the range nTJ' to (n + 1)T — 7’, the ensemble ave 
age of y(y(t +7) is YnYn4z When ¢ is in the range (n + 1)T — 7’ 
(n + 1)T, the ensemble average of y(t)y(t + 7) 18 YnYn4i41- Consideri 
all values of t, we have 

A 


ba) = TONE A) = (1) tad + Pte 


by, (0) (: a 7) a pou | 


A plot of y,(r)/dy,(0) is shown in Fig. 3-7. 


oO 


4 sin? (wT'/2 
(3-40 = aon (er) (1 +2 ». Cy COS yD) (3-41) 


l=1 


ll 


Pp 

Exam le 3-6. Let us cons der A8 an ex imple C t where bisa real const 
consi san é U Un) 

tho Hhaolute value less than unity, 
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142 i? b! cos lwT’ = 1+ 3 bi(eileT + ei?) 
t=1 t=1 


beiet bene? 
sity oye tT —ter® 
La 2 
met (3-42) 


~ 7 — 26 cos wT + b? 
Equations (3-41) and (3-42) give 


Oppo ea b?) sin? (w7'/2) 
dyy(0) 2 7'(1 — 2b cos wT + 6°) (3-43) 


Equation (3-43) is illustrated in Fig. 3-8 with values of b equal to 0.5 and —0.5, 
respectively. 


$,,(jo) | | 

2T ¢5y(0) Is é 
lc | 
oO 


—_— 


1.0 
0.8 
OSes SN ie yD | LP 
O4EE ee NUE, Ci (titi, ere 
0.2 
0 Tv ar 4n 5a 67 
of 


Fig. 3-8. Normalized spectral density. 


3-10. Calculation of the Correlation Coefficients from Conditional 
Probabilities of Input Amplitude. In general, the correlation coefficients 
c¢; are not known, and the distribution function of a, is given instead, 
The distribution of a, depends on the present state of the system, which 
is simply y». Before the time n7’, we do not know what a, will be. We 
do know, however, that the probability of its being within the infinitesi- 
mal range a, + 144d, is P(dn,Yn)5an. Let us assume further that the 
probability density funetion P(an,¥n) depends on the absolute value 
lan + (1 — b)y,| only. If b = 1, the input amplitude a, is completely 
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independent of the past. If b = 0, the instantaneous amplitude y,41 is 
completely independent of the past. i 
To start with, let us compute the average value of YnYny. This can 
be done by mathematical induction. For any given value of yp, there is a 
set of average values Yn+1, Yng2, -- +, Yngt, .. » Which decane on ¥ 
We use the wavy line to indicate that this is a conditional average taken 
under the condition that y, is known. If we form the product yn¥n41 and 
make a weighted average over all possible values of yn, YnYJn41 Would be 
obtained. When the value of y,4:-1 is known, the average value of yn41 is 


(Yntt)av a fie (Qn41—1 as Yn4t—-1) P(An41—-1,Yn41-1) ddn41-1 (3-44) 


Let us introduce a new variable £; & = a@n4i-1 + (1 — b)yn4i-1. Since 
Yn4i—1 1s fixed, dan4i1 = dé, and Eq. (3-44) can be written as 


(Yn4t)av = pe (é + bYn4t—1) p(£) dé 
o / “P(E dE + byngia | * p(é) dé (3-45) 


Since p() depends only on the absolute value of £, the first integral on 
(he right-hand side of Eq. (3-45) vanishes. The total probability of the 
input @n4:1 having whatever possible value is unity; consequently, the 
second integral is unity. Therefore, 


(Yn-+1)av <3 bYn41—1 (3-46) 


(Ynttav has been taken over a fixed value of yn4i1. If we make a 
weighted average on the two sides of Eq. (3-46) over all possible yn41-1 
under the condition that y, is known, it becomes 


Gust = OYnsi-r (3-47) 


‘The wavy line above denotes conditional average. Repeating Eq. (3-47), 
we get 


Gut = b'Yyn 
= YnYnu = dy? (3-48) 


y , ret . 
| 0 evaluate Yn, we note that, for any given y,, the average value of 
Whar is 


YrnYn4t 


vir = [7 Gun + n)*plan + (1 = bya] dan 
= [7 (& + byn)*p(e) ag 
j= A inf (£? + 2bEyn + b2yn”)p(é) dé 
= [> Bp 8) dé + d%yn? (3-49) 
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Averaging Eq. (3-49) over yn, we have 
Tea = f°, Pps) de + bya! (3-50) 


Since the average y,2 should be the same for all in a stationary random 
process, Eq. (3-50) can be solved for y,?. It is 


_ fl: ep@ae 
dyy(0) a Ga he (3-51) 


Yn is dy,(0) by definition. From Eqs. (3-43), (38-48), and (3-51), the 
spectral density ®,,(jw) can be calculated. While we have written 
+ for all the limits of integration, this does not rule out the case of 
limited amplitude. For the latter case, we may simply define p(f) = 0 
for |t| > &m. Once yn reaches &m/(1 — b), dn can be negative. The 
upper limit of y, is therefore ¢m/(1 — 6). It does indicate, however, that 
if y, is limited, b ¥ 1, and the input amplitude a, cannot be independent 
of Yn. 

If b = 1, the autocorrelation function ¢,,(z) is infinity, but the spectral 
density of Eq. (38-43) becomes 


We be P 
bn(is) = ap |”, PP a = Sp (3-52) 


These results are to be expected. With completely independent input, 
the expected value of [y(t)]? is the mean-square deviation times the num- 
ber of inputs (see Sec. A-4). Its expected stationary value is infinity. 
However, its a-c power is limited. Each step input contributes an 
amount 2£/w. Since each input is independent of the other, the total 
power adds to the value given in Eq. (3-52). 

The normalized autocorrelation function is unity. This corresponds 
to the fact that, with an existing amplitude approaching infinity, the new 
inputs cannot make much difference one way or the other. 

3-11. Spectral Density of Recurring Step Function with Random 
Occurrences and Random Amplitudes. In a majority of applications, 
the exact moments of the inputs are not known. However, we know that, 
on the average, there are vy inputs per second. The statistics of the ampli- 
tudes are the same as those given in Sec. 3-9. In mathematical language, 


yd) = > aud — Tr) (3-53) 
where +++ Ty < Ty) <7; < Tz - + + and u(t) is the unit step function 
(see Fig. 3-9). 

lim a v for all n (3-54) 


low ne are Ze 
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We shall follow the same procedure as before, namely, determine the 
autocorrelation function first and then determine the epacteal density in 
terms of the autocorrelation function. To begin with, we shall calculate 
the probability p-(l) that, within a period r, J inputs have occurred. Let 
us divide T into small intervals 67. The probability of two ooourreneee 
in any single interval is (v ér)*. The total probability of having such 
double occurrences is of the order of (r/5r)(v 5r)? = vr br. Therefore 
by making ér sufficiently small, the probability of multiple occurrent 
in any single interval can be neglected. 


Mia. 3-9. Recurring step inputs with random occurrences and random amplitudes. 


Let N denote the total number of intervals, 7/ér. The probability 
of having 1 single occurrences is 


N! 
p,(L) = (WW — Di (1 wmik s ér)4-4 (v 6r)! 


total number nothing occurred single occurrences 
of ways in N —lintervals in 1 intervals 


‘The above expression can be rewritten in the following form: 
1-1 

v |] [oN — 2%] 
h=0 (1 — v 67) A/>8r) oN Gr—vi6r) 


l! 


gris! 
=F | I] @ -—& in| [(1 — v dr) M-r]r—wlir (3.55) 


k=0 
In the limit of 67 approaching zero, it becomes 


pr)? 
p-(l) = ( a el (3-56) 


Nquation (3-56) gives the probability of having exactly J occurrences in an 
interval r. It is known as Poisson’s distribution. 
lhe autocorrelation function is the weighted average of Ynijn4t: 


dyy(r) — », Pr DY nY nse (3-57) 
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Substituting the result of Eq. (3-56) in (3-57), we have 


U Ss 
a ge TU nYnt+l 
in” 


The spectral density can be calculated from Eqs. (3-12) and (3-58): 


byy(7) oe 


vy (0) 


= » cy! 


l= 


where @ = tan! (w/7). 


Bald) _ Yet [ (oryerm (eter + et) de 
+ J0 
1=0 


1 1 | 
li ee. jJw)tt nt (y + jw)'*! 


p? 


—(I+1)/2 ry 
= (1 + =) cos peat): 


(3-58) 


(3-59) 


In the special case of ce: = 0’, y,(7) reduces to a very simple form, and 


,,(jw) can be easily calculated from dy(r). From Eq. (3-58), 


dyy(7) = (vr)! eb! = e— (1-0) 


uy(0) 


U 


The spectral density Pyy(jw) is 
Blin) = dn(0) [oem em + oh) de 


ik 


1 
= yy (0) la a b)v — jw + ( 


_ (1 — b)rdyu(0) 


= @ — by? + @ 


Qv 


[7, #p@ 48 


yy (jw) a al = b)[ — b)2v? + w| 


1 — b)v + Jw 


| 


In the limiting case of completely independent inputs, 6 = 1, 


while $,,(r) becomes infinity. 
function is 


; ve 
$,,(jo) = aE 


(3-60) 


(3-61) 


(3-62) 


(3-63) 


V correlation 
However, the normalized auto 


buy (r) os 1 
buy (0) 


(3-64) 
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These relations are the same as the corresponding ones in Sec. 3-10 and 
can be explained in the same way. 
In the case of completely independent y,’s, b = 0, 


On(r) = Bem (3-65) 
ead Qve? 
and ®,,(Jw) = pan pf (3-66) 


From the above derivations, we see that, if a certain signal is a random 
series of step inputs, its spectral density is of the form 1/w? only if the 
amplitude of each input is completely independent of all others. Other- 
wise, the spectral density becomes of the form 1/(w? + w,2), where the 
corner frequency w; is (1 — b)y. Correspondingly, the spectral density 
for a random series of inputs of the waveform f;,(t) is 


Aw F;,(jw)F.(—jw) 
w? + wy? 
where A is a proportionality constant. 

3-12. Spectral Densities of Signals Containing Pure Sinusoidal Com- 
ponents. In Fig. 3-2, the spectral density represents the power within 
4 narrow frequency band per unit bandwidth, and the cross-spectral 
density represents a similar product involving two signals. In case both 
signals are sinusoidal and of the same frequency wo, the measured power is 
sero if wo is not included in the passband, and a finite power P if wo is 
included in the passband. For instance, P may be the power dissipated 
in a 1-ohm resistor or the vector power of an a-c source, depending on 
whether the spectral density of an alternating current alone or the cross- 
spectral density of emf and current are being considered. In any case, 
we may write for positive values of w; and we 


1 ris hy a Py if w, < wo < we 
if #,;(jw) dw = i ae Mag Odie ph (3-68) 


Gi,i,(Jw) = (3-67) 


Obviously, ®:;(jw) is zero for all values of w ¥ wo. In order that the inte- 
gral of Eq. (3-68) be finite, ;;(jw) must be infinity ata = ws. A function 
of this nature is called an impulse function or delta function and is denoted 
ly 4(«). Its properties are discussed in Appendix B. Considering that 
Ny (jw) = [%:;(jw)]*, the spectral density ®,;; can be written as 


P;;(jw) = r[PF 5(w + wo) + Pi5(w — wo)] (3-69) 


If a random time function contains a number of purely sinusoidal 
fomponents, 


it) = f() + ¥ Ax sin (wit + ox) (3-70) 
k 
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where #;;(jw) is bounded with perhaps an exception at w = 0, then 
Hi (jo) = Sys(5a) +5 bs Ay"[5(w — wx) + 8(@ + )] (8-71) 
i 


Equation (3-71) is a direct consequence of Eqs. (3-18) and (3-69). Obvi- 
ously, the cross-spectral densities of sinusoidal functions of different fre- 
quencies vanish. Since &;;(jw) is bounded, there cannot be any coherent 
sinusoidal component in f(¢), and the cross-spectral densities between f@® 
and the various sinusoidal terms are vanishingly small compared with the 
8 functions. Equation (3-71) gives ®:(jw) as the sum of the spectral 
densities of all the individual components. 

In practice, a purely sinusoidal signal perhaps does not exist. There 
is always some frequency or phase or amplitude fluctuation, and the only 
question is how much. Such a signal can be approximated by a random 
series of damped sinusoidal functions in which each input amplitude a 
is completely independent of all previous a’s: 


y(t) = ¥ au(t — rie sin wo(t — 72) (3-72) 
L 


From Sec. 3-8, the spectral density of y(¢) is the same as that of each 
individual term except for a constant multiplier. Therefore 


A 3-73 
*i) = GotaGete)Gerage a) orm 


where a = ¢ + jwo, and A is a constant which depends on the amplitude 
distribution of a, the number of 7’s per second, ande. Since 


ee Det A F 
tn(0) = ge [ Baja) do = gs (3-74) 


Bq. (3-73) becomes 
' 4aa* byy(0) 3-75 
Fy(jw) = (jw + a)(jo + a*) (jw — a)(jo — a*) ( ) 


Equation (3-75) is useful in studying the optimum filtering of a sinusoidal 
noise signal from a random input signal. ; id 
As ¢ approaches zero, y(t) approaches a purely sinusoidal wave. 4 
is always possible to adjust the distribution of a; in such a way that ¢,,(0) 
ins constant as e approaches zero. 4s 
ae ere Summary. is ile chapter, we have defined spectral densities 
and correlation functions for random signals, both mathematically and 
physically; derived their transforms and other properties ; and sabes 
ways of calculating these functions for random signals of the genera _ - 
shot-effect type directly from the probabilities which specify these signals, 
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The concept of a distribution function for system error or other system 
variables is introduced. When the system inputs are either Gaussian or 
highly random in other predesignated ways, normal distribution can be 
used to approximate the actual distribution. Under such conditions, 
once the mean-square amplitude of fluctuation of a system variable is 
known, its distribution is completely known. A corrollary of this is that 
a system with the least-square error gives the least cost no matter how 
the cost function is defined as long as it isa non-decreasing function of the 
error. Similarly, a system with the least-square amplitude of the input 
to a saturable fixed component has the least tendency to saturate. The 
concept of stationary processes is introduced, together with the ergodic 
hypothesis which states the equivalence between time and ensemble 
averages. 

Having thus established the plausibility of the least-square criterion, 
we have shown that for stationary systems the only required data for 
optimum design based on this criterion are the spectral densities of the 
input signals. As another facet of spectral densities, the correlation 
functions are defined and are shown to be the inverse Fourier or Laplace 
transforms of the former. The physical and statistical significances of 
these functions are also illustrated. Equations relating the spectral 
densities of the output functions to those of the input signals and transfer 
functions of the system are derived. 

The spectral densities of random signals of the generalized-shot-effect 
type can be calculated by calculating first the spectral densities of the 
corresponding unit step series, and the waveform factor is taken into 
consideration later as a separate step in Eq. (3-36). The correlation func- 
lion of a unit step series is calculated from two given factors: (1) the dis- 
\ribution as well as the interdependence of the successive amplitudes and 
(2) the distribution of the breaking points or times of change in amplitude. 
When the latter is completely random, Poisson’s distribution is derived 
und used as a basis for such calculations. Once the correlation functions 
ure determined, the spectral densities are calculated by Laplace or Fourier 
transform, 

Sometimes purely or nearly sinusoidal components are present in a 
random signal. The former give rise to 6 functions in the spectral den- 
“ity, while the latter introduce a sharp but finite peak which can be 
(leseribed by a rational function of w?. 

In closing, it may be remarked that the use of spectral densities to 
fepresent stationary signals is not only a powerful technique but is a 
tural one for electrical engineers. Because of the transform relation- 
ship between spectral densities and correlation functions, whatever 
wialytical results are obtainable from the use of one representation 
are also obtainable from the use of the other. However, as the relation 


ee 


dt 
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between input and output spectra of a linear network is pene the 

same for random signals as it is for steady-state sinusoidal signa s, ot 

analytical work is on familiar ground when spectral densities are used. 

iliari i ion functions. 

No such familiarity exists for correlation : 

On the other hand, correlation functions are haw capris ea 
i i in Chap. 5. The spectral-densi 
stationary signals, as we shall see in 8p é 
function is easontially a time-averaged quantity. Its significance is lost 


hen time average becomes meaningless. 
hi We may conclude that spectral densities should be used for analyzing 


stationary (or approximately stationary) systems, whereas correlation 
functions should be used for analyzing nonstationary systems. 


CHAPTER 4 


OPTIMUM SYSTEMS WITH RANDOM INPUTS 


The design of systems with random inputs is perhaps the most sig- 
nificant problem that makes least-square optimum-design techniques 
worthwhile. In practically every high-performance system, the ultimate 
error is caused by noise or load disturbance or some other undesirable 
random inputs. There is always the thermal noise or the induced noise 
in a highly sensitive sensing element or in the inertial or some other 
reference system. After every removable source is eliminated, the 
residue is always there and most likely random in nature. Load disturb- 
ance is even more fundamental. We can say with some justification 
that, without consideration of load disturbance, a feedback control 
system has little to offer over a feedforward system. 

In this chapter, we shall formulate the optimum-design problem in gen- 
oral terms, give a general solution, and then proceed to show its various 
upplications. One advantage of this approach is that it shows how 
various design considerations can be incorporated into the same pro- 
cedure. However, our purpose is not to present the derived procedure as 
i cookbook solution of design problems of this type but rather to illus- 
(rate the method. 

The mathematics is basically the same as that used in Chap. 2. In 
fact, for simple problems, very little change needs to be made. For 
instance, if in Example 3-2 F(s) is to be determined so that 

fe®P = min 

GOP < Ke 
the method and solution of Sec. 2-4 can be taken over almost in their 
wntirety, with %,,(jw) taking the place of R(s)R(—s). As we have little 
iwod for preliminaries, we shall proceed to the general case directly. 


GENERAL PROCEDURES AND METHODS 


4-1, Optimization Criterion and Constraints. Figure 4-1 illustrates a 
lypieal system with a single controlled variable. The given elements 
we the controlled system or fixed component G(s) and the main-loop 


feedback function H(s) [Hm(s) is usually a constant]. The desired out- 
63 é 
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put ca(t) results from a specified linear operation L(s) on the reference 
input r(é). L(s) is purely hypothetical and may or may not be physically 
realizable. For instance, the desired output ca(t) may be the reference 
input 7 sec later: 
ca(t) = r(t + 7) 
The Laplace transform of the above equation gives 
Ca(s) = £{r(t +7)} = e*R(s) (4-1) 


The linear operator L(s) is then e*, which obviously does not represent 
any physically realizable network. Other types of linear operation may 


r(t)+n(t) 


| 
| 
Y 
| 
| ' ! 
steeatreeticsyer ont H(s) pet teretiniecspataty ag 


Fia. 4-1. Block diagram of a system with multiple inputs. Solid line, fixed elements; 
broken line, possible arrangement and compensating elements to be designed. 


be physically realizable; for instance, a follow-up system is represented 
by L(s) = 1. Besides the reference input, there are also undesirable or 
spurious inputs into the system, such as noise n(t) and load disturbance 
d(t). The transfer function G2(s) represents the process through which 
the controlled variable c(t) is affected by the load disturbance d(¢). For 
instance, d(t) may be a torque and c(t) the shaft position, and G2(s) is 
then 1/Ms?, where M is the moment of inertia. The system error e(¢) is 
defined as 

e(t) = ca(t) — c(t) (4-2) 
Our design problem is to select the series-compensating function G(s) 
and/or the parallel-compensation function H(s) so that 


[e(t)]? = min (4-3) 


Sometimes there is a multiplicity of inner loops and compensating net= 


works, These apparently complicated systems are, however, reducible 
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to the form shown in Fig. 4-1, but the latter cannot be further reduced 
without altering one or the other of the closed-loop system functions 
C/RandC/D. From Fig. 4-1, C/R and C/D can be readily expressed as 


1 


1 

(7B) ~ En) + HO) + a Hae (4-4) 
C; nS G2(s) 6} 
D (s) = GG) RB (s) (4-5) 


Equations (4-4) and (4-5) illustrate that C/R and C/D can be inde- 
pendently determined. Given these two functions, G.(s) can be deter- 
mined from Eq. (4-5), and H(s) is then determined from Eq. (4-4). 

Because of the presence of a fixed component, there are two types of 
constraints or limiting factors on the system: 

1. Functional Constraints. In order that the closed-loop system be 
stable, all the poles of the system functions C/R and C/D are confined to 
the LHP interior. Additional restrictions exist if the fixed component 
has one or both of the following properties: 

a, TRANSPORTATION LAG OR DEAD TIME. If the fixed component has 
absolutely no response for a period 7 subsequent to the application of a 
forcing signal, the period 7 is referred to as transportation lag or dead 
lime. Mathematically, we may write Gi(s) as having a factor e-**. 

b. NONMINIMUM PHASE. If G,(s) has one or more zeros in the RHP, it is 
referred to as a nonminimum-phase function. The term minimum phase 
was used by Bode to define the class of functions that obey his famous 
(heorem relating gain and phase. For a transfer function to represent a 
slable system and to have minimum phase, all its characteristic fre- 
(uencies must be in the LHP, including the origin. A pole in the RHP 
causes the component to be unstable without feedback, and a zero in the 
K11P causes it to have a greater phase lag for the same gain character- 
istics. The latter point can be illustrated by an example: Let us compare 
{wo transfer functions, one with 1 + s and the other with 1 — s in the 
iumerator but identical otherwise. The gain of the two functions is 
identical at all frequencies. However, the phase lag of the latter is 
greater than that of the former by 2 tan—! w. 

Physically, if the initial response of a component to a step input is 
opposite to its steady-state response, the component must have non- 
iiinimum phase. Obviously this initial reversal necessarily causes delay 
i the closed-loop system response. 

I'rom the above discussion, G(s) is expressed as 


G(s) = e™P(s)Gir(s) (4-6) 


where P(s) = I] ¢ - £) (4-7) 


i J 
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The d,’s are zeros of G(s) in the RHP, and Gi:(s) represents the balance 
of the transfer function. The question is: What are the restrictions 
imposed by e~*P(s) on C/R and C /D? 

With reference to Fig. 4-1, let 7(¢) denote the input to the fixed com- 
ponent G,(s). The closed-loop transfer functions J/R and I/D can be 


expressed as follows: 


2 -~G@R® a 
b= galbo-%0| (4-9) 


It is physically impossible to apply to the fixed component a forcing 
signal i(¢) in anticipation of the system input or load disturbance. There- 
fore the system functions 7/R and I/D cannot possess the factor e”*. 
Furthermore, J/R and I/D cannot possess 1 /P(s), because in that case 
i(é) would rise exponentially as a result of any thermal agitation or a 
sudden input. While the system is stable in the sense that the controlled 
variable c(t) has only exponentially decreasing terms as long as the system 
is linear, it would not operate within the linear range for long, because of 
the exponentially rising i(t). Therefore the factor e/P(s) which is 
implicit in 1/G(s) on the right-hand side of Eqs. (4-8) and (4-9) must 
be canceled by a factor e-*P(s) in (C/R)(s) and (C/D)(s) — G2(s), 
respectively. 

The fact that the fixed component is open-loop unstable, as exhibited 
by poles in the RHP, does not cause any restriction on its closed-loop 
response. The poles are readily removed by inner-loop feedback, which 
can be considered as part of H(s). 

The functional constraints are taken into consideration in the opti- 
mization procedure by assuming appropriate unknown functions which 
will be denoted as F,(s), i = 1, 2, .... The only restriction on the 
F;(s)’s is that all their poles be confined to the LHP interior. 

Let us consider (C/R)(s) first. If G,(s) is known’ to be stable, we can 


write 
Cc 
© (8) = FOG) 
The above expression automatically takes into account the restrictions 
due to transportation lag and nonminimum phase. It is not adequate if 
G(s) has one or more poles in the RHP. On the other hand, if we write 
C 7 
R (s) a I (s) 


poles of G(s) in the RIP do not present a problem; however, additional 
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provisions must be made for the z i i 
yr eros. An expression which i 
all situations is J eras 


C 
R (s) = e-*P(s)F,(s) (4-10) 


In Eq. (4-10), onl r 32 
sae ) y the RHP zeros of G(s) are explicitly expressed. 


C 
p (8) = &*P(s)F2(s) + Ga(s) (4-11) 


’\(s) and F2(s) are unknown functions i 
to be determined. The conditiox 
ee oan es a and F.(s) be in the LHP, together with Bigs. 
-10) and (4-11), is sufficient to ensure that the functi I 
Neco: (nate nctional constraints on 
2. Amplitud J 1 i 
. plitude Constraints. The amplitude constraints may be expressed 


aE <K; i=1,2...,MN (4-12) 


In ee seep the a,(t)’s represent N signals which either exist in the 
controlled system or i 1 i ; 
agg M4 may be obtained by given linear operat’ons on such 
a ie be emphasized at this point that there are two aspects of our 
problem: the analytical aspect and the engineering aspect. From the 
ees tien) point of view, we pose Kgs. (4-3) and (4-12) at the start and 
P ‘ aina system which satisfies these conditions, without questioning why 
rom the engineering point of view, we can never exhaust the various 
ona situations under which these criteria are justifiable. While our 
mbuc y is concerned mainly with the analytical aspect, it may be worth- 
W % to mention a few situations which give rise to Eqs. (4-12). 
A Heat face The copper loss in a motor is proportional to the square 
of the winding cu i i i 
- g current, which can be considered one of the constrained 
» ‘ : 
| a oe The fixed component saturates when the input level is 
00 sii The saturation may take many forms: for instance, magnetic 
. uration in a motor, maximum effective displacement of a rudder or an 
0 av ator, maximum flow of a hydraulic valve, etc. As saturation always 
pies rise to an undesirable effect, the simplest way to take saturation into 
i vee is to avoid it as much as possible by limiting the mean-square 
4 ue of the variable a;(t) as expressed by Eqs. (4-12). Let &; represent 
_ ae limit. For Gaussian random inputs, it can be readily 
shown from See. 3-2 that the pr ili 1 ' 
ees, 1c probability or fraction of time for a,(t) to 


; ee §& 
P(la| > &) = 1 — * (Ser) (4-13) 
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If \/K; is selected as a small fraction of &, 1 — (&/ 4/2K;) is very close 
to zero. Therefore the inequality of (4-12) controls the probabilities of 
saturation at various points in the system. 

3. System bandwidth. The cost of components increases with the 
system bandwidth. This is partly because of the saturation effect which 
is accentuated by any attempt to speed up the system response and partly 
because of the resonant frequencies and phase lags of the various com- 
ponents in the higher frequency range. The transfer functions that we 
use in ordinary design work are approximations at best, in the sense that 
small time constants are neglected and distributed lags are treated as 
simple time constants. Notable examples of the latter are the transmis- 
sion lag in hydraulic conduits and torsional shafts. It is costly to elevate 
these effects to a higher frequency range; furthermore, there is a limit on 
what we can do, considering cost or not. 

Newton introduced an artificial measure of the bandwidth and posed 
as a condition for system design that the bandwidth so measured be below 
a specified value. One would accomplish the same result by requiring 
that the mean-square value of the system output in response to a hypo- 
thetical random input w(t) be limited or at a minimum. The spectral 
density of the hypothetical input ®.w(jw) is a rising function of w. By 
assuming different functions for ®,.(jw), we can reflect the practical 
undesirability, in cost or otherwise, of a high-frequency system. 

4. Other criteria. There are many other ways of establishing con- 
straints. For instance, we may not wish to limit the complete system as 
much as to limit to lower frequencies the input 7(¢) of a certain component. 
A high-pass filtering function operating on 7(¢) gives the constrained signal 
a;(t). 

4-2. Solution of the General Problem. Having thus far put our 
optimization criterion and the various constraints into mathematical 
expressions, we shall proceed to obtain a solution of the problem. The 
first step is to express the system error in terms of the inputs and the 
unknown functions F;(s) and F2(s): , 


E() = Cals) — Cs) = LR) — F (RS) — 5 ODE) 
= L(s)R(s) — e-*P(s)Fi(s) Ri(s) — [e-*P(s)F2(s) + G2(s)]D(s) 
(4-14) 
To make the problem more definite, let the constrained signal a(t) 


specified by the following equation: 


A(s) = Gs(s)1(s) = a [C(s) — Ga(s) D(s)] 


where G(s) may be unity or some other network function, as discusse 


(4-15 
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in the preceding section. Making the same substitutions as was done in 
liq. (4-14), we obtain 
e—P(s)G3(s) 


A(s) = G(s) [Fi(s)Ri(s) + F2(s) D(s)] (4-16) 


Equation (3-11) gives the mean-square values e? and a@ as the following 
integrals: 


a 1 fie 
apes Oa i ®.e(s) ds (4-17) 
Se TY hes 
C= ge Paa(s) ds (4-18) 


rom Eqs. (3-14), (4-14), and (4-16) the spectral densities ,.(s) and 
Paa(8) can be expressed in terms of the spectral densities of the inputs, 


rd), ) (). It is assumed that no correlation exis 
f t) 
and d(t): ®,,a(s) pes $,a(s) 0. LSts between 1 1(é) 


®..(s) = LL@,, + Dee end Lé,, ae L&,, 


LL®, ae (4-19) 

$.-(8) = PPF\F1®,,,, + (€7*PF, + G2) (e*PFs + G2)®aa (4-20) 

Bre(8) = PF yy, (4-21) 

&.,(3) = ePP,S,,, (4-22) 
_ GGPP |, ‘ 

Paa(s) = GG (Fi 1®,,., “4. Ff oF aq) (4-23) 


Wquations (4-17) to (4-23) give ¢? and a? in terms of the unknown fune- 
lions #; and Fy. Our design criterion is that @ should be a minimum 
while a? is limited by inequality (4-12). We can follow the same silane 
dure as was used in Sec. 2-4 to derive optimum forms of F,andF>. Alter- 
tively, we may use the optimization theorem which will be derived in 
(he next section. It gives the conditions to be satisfied as 


IPDee O® 
—+k?—“=X 
i ania (4-24) 
IB oe * } 
a Oe = Xi 
oF’, OF's 
where k? is Lagrange’s undetermined multiplier and X i(s) and X.(s) are 


specified except that they do not possess any poles in the LHP. The 
reason for using k? instead of the conventional 2 is explained in Sec. 2-4. 
lhis condition, when taken together with the restriction on the pole loca- 
lions of Fy and Fs, gives a unique solution of the problem, as will be shown 
lwlow, In taking the derivatives, /’,, F'., F\, and Fs are considered inde- 
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pendent. Equations (4-24) give 


YYPPF,@,,,, — e*PL®,,, = X,(s) (4-25) 
Y YPPF 8.4 + eT PG 2Paa = X2(s) (4-26) 


where Y is a function with all its poles and zeros in the LHP and is defined 
by 


ak (4-27) 


Y=}1 = 
[hha 


The two unknown functions F; and F, can be solved independently. 
Let 


{®rr}* 
{Baa} * 


All the poles and zeros of Z and D are in the LHP. The function D is 
boldfaced so that it will not be confused with the Laplace transform of 
d(t). Following the steps which led to Eq. (2-29), we arrive at the follow- 
ing solutions: 


(4-27a) 


Mtl 


Z 
D 


eee ag (4-28) 
PYZ |) Pyar 

#4 I e®PG2D (4-29) 

ie* 5 | PY I. 

Let Y.(s) be defined as 
_ P(=9¥@) a 
Yi(s) =S Fay ( ) 
From Eqs. (4-10), (4-11), (4-28), and (4-29) we obtain 
@ = eos CL Pr, (4-31) 
c@ = Al a ib 
—Ts erg D }- 

50 =A) -¥5 [SP] (4-32) 


From Eq. (4-27) we see that 1/Y can be obtained by a root-square-locus 
plot from Gi/G;. The zeros of 1/Y, however, are in the LHP by defini- 
tion. They may or may not be the original zeros of Gi/G3. The defini- 
tion of Y,(s) implies that all the zeros of 1/Y(s) which are not zeros of Gy 
are replaced by the latter. Since none of the poles of Gs; can be in the 
RHP, all the zeros of 1/Y(s) are the same as zeros of @,/G, while all the 
poles of 1/Y,(s) are obtained by the root-square-locus plot. Thi 
description allows Yi(s) to be determined directly, without resorting 
Iq. (4-30). 
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An alternative way of solving the above problem is to consider d’(t) 
of Fig. 4-1 as the load disturbance. Let the primed functions represent 
the ones to be used instead of the original functions. Then 


Pya'(s) = Ge(s)G2(—s) Baas) 
ia ed (4-33) 


If G(s) has minimum phase, the two methods are equivalent. If 
(/,(s) has zeros in the RHP or transportation lag, the two methods repre- 
sent different physical situations: The former method applies when d(é) 
is directly measurable, and the system block diagram of Fig. 4-1 is cor- 
respondingly modified. The latter applies when d(é) is not measurable, 
and its effect on c(t) is the only quantity measurable through the error- 
sensing device, as illustrated by Fig. 4-1. 

4-3. An Extension of Variational Calculus. In most applications with 
least-square criteria, the integrand of the integrals which are being mini- 
mized, or limited in value, is of quadratic form in the unknown functions 
I’\(s) and F,(—s). However, it may be desirable to state the variational 
problem in a relatively general form. Let the integrals J, be defined as 


1 fie ores = 
i. = a ®,,(s;F'1,Fs, nish FPajF1,F 2, he. be Hage) ds 
m=0,1,2,...,M (4-84) 
The functions $, and F;,m = 0,1,2,...,M,i=1,2,... ,n, satisfy 


the following conditions: 
|. F;(s)’s are physically realizable network functions, in the sense that 
(hey are real functions of s with all the poles in the LHP, thus excluding 
(he imaginary axis. 
2. &,’s satisfy the conjugate condition that, for all subscripts 7 and m 
and arbitrary @, 
0®,, 


a, _ 9®n 
aF; 


8=Jw1 oF i 


(4-35) 


8=—jar 
In taking the partial derivatives, 7; and F; are considered independent. 


| Condition 1 is obvious. Condition 2 is equivalent to the following: Given that 
the functions F;(s),7 = 1,2, . . . , n, are real functions of s, ®n is real on the jw axis 
Wid is even in w, Pn(jo) = &,(—jw). This condition obviously holds for physical 
*yslems and is sufficient for condition 2. The proof will not be given here as it is 
foitine and lengthy. Condition 3 can be interpreted as follows: Let us regard s, e78, 


h(), and F;(s) as a set of variables. If ®,, can be expressed as a polynomial in these 
variables, condition 3 is easily shown to be true. We shall see that in all the applica- 
Hion problems of this chapter ©, can be expressed as a polynomial in these variables. 


However, as condition 3 also holds for some other form of ®,,, it is the weaker condition 


if the two. It is always desirable to use a weaker assumption, as the result of the 
theorem is then more general, 
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3. For admissible F,(s)’s, that is, the F,(s)’s which give convergent 
values of all the integrals Im, 
lim sf, 22" = 0 (4-36) 
oo 1 a 
In case 06,,/0F; has essential singularities at infinity, such as e”, Eq. 
(4-36) is required to hold only for s with a negative real component. An 


optimization theorem may be stated as follows: 
If conditions 1, 2, and 3 hold, a necessary and sufficient condition for 


M 
I= Y Anln 
m=0 
to be stationary with respect to infinitesimal variations of Fi(s), F2(s), 


.. . , Fn(s) and corresponding variations of F1(—8), F.(—s), ... , Fa(—8) 
is that the partial derivatives 


M 
ie py 8 - (4-37) 
m=0 


do not have any pole in the LHP including the imaginary axis, where the 
Am’s are arbitrary constants. 


From Lagrange’s method of undetermined multipliers, we see that 


M 
z AnPn being stationary is a necessary condition for the following: 
m=0 

To = min 

1 Ee Ce, mew td. (4-38 


The proof of the theorem is as follows: 
Let ¢f;(s)’s be an arbitrary set of infinitesimal difference functio 


where ¢ is an infinitesimal constant and the H;(s)’s are finite. Since t 
admissible F;(s)’s satisfy conditions 1 and 3, the difference functio 
eH;(s) between two sets of admissible F;(s)’s must meet the same con 
tions. The first-order variation 6J is 


M n 
Xr i2 Ob 0®,, = 
m OPm ry AR ee 4e 
: Dane aes 2S A.) as 0 (4-39) 
m=0 4=1 


Since the H,(s)’s are independent of each other, Eq. (4-39) is equivalent 


M 
r,, [2% [88m ID», ) in 
x ™ HH, + —" A,)ds = 0 i=1,2,...,7 (4 
> 33 | (or i oF, 
0 


je 
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By using Eq. (4-35) and substituting —s for s in the following integral, it 
becomes 
J? AD, IPY OOo 
en, Heds = — A; d i 
i OF; a or; Z (A 4) 
Substitution of Eq. (4-41) in (4-40) gives 
_ M 
je ODn = 
7 : oe OF; H; ds = 1) (4-42) 
m=0 


As a result of condition 3, the path of integration of Eq. (4-42) may be 
extended from +j* through a large semicircle to —j enclosing the 
entire LHP, and there results 


f warXi(s)H(—s) a= 0 ites iL, 2, Peso ata (4-43) 


Since H;(s) does not have any poles in the RHP including the imaginary 
axis, it follows that H;(—s) does not have any poles in the LHP including 
the imaginary axis. A sufficient condition for 67 = 0 is that the func- 
tions x i(s) do not have any poles in the LHP including the imaginary axis 
his is also the necessary condition, because of the arbitrariness of 
I, ( —8). If any of the functions X;(s) has one or more poles in the LHP 
a function H;(—s) can be found such that the sum of residues does si 
vanish, and the condition embodied in Eq. (4-43) will be violated. 

Nquation (4-37) with the above specification on X;(s) can be easily 
remembered as an extension of variational calculus. If we do not require 
\he poles of F;(s) to be in the LHP, the result of variational calculus is 
\hiat the expressions for i = 1, 2, . . . , n on the right-hand side of Eq. 
(1-37) vanish. With the additional restriction on the locations of the 
poles of F',(s), the result is correspondingly relaxed and the same expres- 
sion is only required to be analytic in the LHP. This kind of relaxation 
is essential to finding a solution satisfying simultaneously the restriction 
on the location of the poles of F;(s) and Eq. (4-37). 

: Some of our friends may ask, with Eq. (4-37) and the restriction on 
Hi(s), can we find a unique solution to the set of unknown functions 
(8)? As mathematicians, we may simply turn up our noses to imply 
\hat to answer such questions is beneath our dignity. A mathematical 
\heorem need only establish the equivalence of two sets of conditions or 
oulablish that one set of conditions implies another set; whether or not 
wy set leads to any useful result is not the point. However, as engineers 
we do have to answer such questions. _ If ,, is bilinear in F;(s) and Fs), 
hq, (4-37) leads to a set of simultaneous linear equations in F,(s). Thee 
euutions, together with the restriction on the locations of the poles of 
/(«), do lead to unique solutions of F,(s). 


—=<— 
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APPLICATION PROBLEMS 


In the following sections, we shall utilize the previous result to study 
the effects on system performance due to various limiting conditions and to 
obtain the optimum forms of filtering and compensation under various 
special circumstances. In order to demonstrate the physical signifi- 
cances, only one or two limiting conditions are taken up each time. 
Admittedly, a larger number of limiting conditions may be present in a 
system; the difference in arriving at a solution is, however, in the amount 
of effort required rather than in kind. 

4-4, Optimum System in the Presence of Load Disturbance. As an 
example, we shall study the optimum structure of a positioning system in 
the presence of load disturbance. The block diagram of the system is 
shown in Fig. 4-1, in which G;(s) = Hn = 1, G,(s) and G,(s) are known, 
and G, and H are to be determined. In case the controlled system is of 
the minimum-phase type, we can usually save computation by defining 
F, and F, as the system functions C/R and C/D. The optimization 
procedure of Sec. 4-2 gives 


1 [® 
Pie= <li 4-44 
k? G2D 
Fo = ss —_ 4-44a 
ee 2 leer, 


The functions Y, Z, and D are given in Eqs. (4-27) and (4-27), with 
G; = 1. The spectral density of the reference input signal, the transfer 
functions Gi(s) and G.(s), and the fixed components are given as 


= M A?(w? + w”) ‘ 

Grn = Be = StH we) (4-45) 
K 

Gi(s) = KrG@s(8) = are (4-45a) 


Equation (4-45) implies that the noise is negligible, and (4-45a) implies & 
disturbing torque or force. 
Two different types of load disturbance are studied: 
1. White Gaussian disturbance. 


Daa = B? 
2. A random series of step inputs. 
B2 
Pua = GF fot 


The load disturbance is assumed to be uncorrelated to the referen 
input signal. The closed-loop transfer function F, is, according to Bq. 
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(4-44), 
Pes e 
Fi = $9 Bl (4-46) 
h = Se 
where Y = (1 + as)(1 + bs) = oe (s? + 2fe.8 + w,?) (4-47) 
i A (s +w1) 
ee a) (4-48) 


The constants a and b are the same as those given in Eqs. (2-65) and 
(2-66), ¢ = cos 6; and w, = 1/r. Substituting Eqs. (4-47) and (4-48) 
in Kq. (4-46), routine calculation gives 


F, — (s + 4) 04,2 / 4 
(s + w3)(s? + 2eoxs + wn?) (4-49) 

where ieee we? + Waowe + wo? 
i aa 02 + Wow, + wiwo (4-50) 


Next we shall determine F2(s) for th i 
Bees 2(s) e two different types of load 


Case 1 
Pia = B? D(s) = D(s) = B 
Iquation (4-44a) gives 


F,(s) = sige l —ns + Ms? | 
+ 


Kr? Y(s) | 1 — as)(1 — bs) 
ae NP PS Ci C 
Kr? Y(s) E 1 — as i 1 oI, (4-51) 


where C; and C2 are constants. We note that the constant term repre- 

rents a pole at infinity.| Since Y(s) and Y(s) are of the order of s? at 

infinity, it is possible for F.(s)¥(s) to have a pole at infinity but not for 

RS Consequently the term M/ab is considered an LHP term, and 
« have 


2 
F2(s) = tM M 2 = ket /Kr (4-52) 


& . a is finite and nonvanishing, a pole at a can be written as either C/(s — a) or 
4 - 8/a). As a approaches infinity, the former form cannot be used since it 
riage aire! zero, unless we make the assumption that C is proportional to a 
which reduces it to the latter form. The latter f 
een oe r form reduces to a constant C’ as a 
Bas sheng Bend of obtaining the same result is to determine the limiting form of the 
votion ’, a8 8 approaches infinity. Since the loop gain 
} of t 
Approach 0 as w becomes infinity, By See 
k kB 


Pym Gy and YDF.— = BQ; = —- 
q, Ky 
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From Eqs. (4-4) and (4-5), the compensating network G, and feedback Case 2 
function H are obtained: 2 
Daag = pe kon D= awe 
= lo s+ 0 (4-53) B . y wo? + w? Er Been 
Or Le bam iquation (4-44a) gives 
iia ei knoys 8 + ws 2rwis s + w1 — coe(cor/or — 1)/2¢ (4-54) F.(s) = Kh stor —ns + Ms? 
Kroi 8 + o; W 1} 8+ 0; Kr? Y(s) | (1 — as)(1 — bs)(s + oe) |y 
Equations (4-53) and (4-54) are illustrated in Fig. 4-2. If there is addi- os es Mw;? + nwr 1 
tional damping due to armature speed emf, 7 in the regenerative feedback Kr? (1 + awe)(1 + bwe) ¥(s) 
branch is replaced by » + K.Kr/Ra, as shown by the dashed line in Fig. — Kk aw) _ _ kwy?a(we)/Kr 
4-2. Kr Y(s) s? + 2rars + wy? (4-55) 
Tf wo is negligibly small, the term #1 — wy (w1/«, — 1)/2¢ vanishes, and where a(w2) = 22? + 02/M 
the lead network in the degenerative feedback branch is replaced by a 2” + Wwow, + wr? Png (4-56) 


l'rom Eqs. (4-4) and (4-5), G, and H are obtained as 
u @18§ sf w4 


differentiating circuit with time constant wi-', The system becomes a 
type 2 system. Physically, ®,, of Eq. (4-45) may represent a composite 


series of random step and ramp inputs, and wp is due to the correlation Gi, = Poses are apt (4-57) 
between successive ramp inputs. If the slope of the reference input r(t) Tey : ; 
does not change frequently, w: would be relatively small, and we obtain a H = wo . :: =i (=ns i l-a Ms*) 

TW) W, a 


type 2 system as the optimum result. 


+ 2fwis[s + wr — wx(w1/H4, — 1)/2¢] 


d(t) w1W%(S + 4) (4-58) 
: [quations (4-57) and (4-58) are illustrated in Fig. 4-3. Physically, 
Amplifier and | Motor and {load system (he spectral density aq of case 2 corresponds to persistent load disturb- 


lead network 


Aneeeyne 


Amplifier and High-gain 
network amplidyne Motor and | load system 
+ 


Fig. 4-2. Optimum compensation of a system in the presence of white Gaussian load 


disturbance. network 


Tachometer 


_ Qa’ 1 + s/o’ 


H - lead network 
u(s) woe 1 + 8/0} seach ie 
ri wr(wi/, — 1) 
ye et FS moral 
2r Vio, 4-3, Optimum compensati f tem in th 
ee. a, 48. ‘ompensation of a system in the presence of persistent load dis- 
Gals) /w\ turbance. m = + K,Kr/Ra. H,(s) and G,(s) are the same functions of s as in 


" hKe 1 + 8/o1 Vig. 4-2. 
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ance. The smaller the value of w:, the longer the load disturbance is 
applied before it shifts in value. From Eq. (4-56), we see that a(w2) 
is approximately proportional to as, and the amplifier gain varies as 
1/ws. With increased amplifier gain, the output variable is less and 
less affected by the presence of load disturbance. Equation (4-58) shows 
that the higher amplifier gain is offset by additional degenerative feed- 
back, so that the closed-loop transfer function F; = C/R is not altered. 

The above two cases are an interesting illustration of the ‘engineering 
common sense” of the optimization process. If the load disturbance is of 
the impulse type given in case 1, it would pay to save the motor power 
and let the load system inertia play some part in counteracting the dis- 
turbance. If the load disturbance is a series of persistent step inputs, it 
must be balanced to a greater extent by motor power, and the amplifier 
gain is raised to make this possible. 

4-5. Noise and Optimum Filtering. In the literature there are various 
ways of reducing the undesirable effects of noise: nonlinear devices which 
operate on the amplitude and linear devices which operate on the fre- 
quency. We may say that, if phase coherence exists between the various 
Fourier components of the desired signal, a nonlinear device is some- 
times effective. Without such phase coherence, linear filtering appears 
to be the only method. At present, we shall take up only optimum linear 
filtering and omit the nonlinear devices completely. 

There is a significant difference between conventional filter theory and 
the optimum filtering that we are about to discuss. In conventional 
filters we usually disregard the phase relationship between the input and 
output signals; however, in servo work the phase error is at least as 
important as the amplitude error. The system error is defined as r(t) = 
c(t) in the time domain. When transformed into the frequency domain, 
the Fourier component of the error is the vector difference between the 
Fourier components of the input and output signals. 

We shall illustrate our method for determining the optimum filter in 
the presence of noise of the following two types: 


1. Sinusoidal noise 
2. White Gaussian noise 


Case 1. Sinusoidal Noise 
: 4aa*(a + a*)Pn 
Pan (jw) = (w? + a®)(w? + a*?) (4-59) 


where P,, is the noise power, a* is the conjugate of a, 
a=e+ jwo 


and «is a very small constant. If the noise is a perfect sinusoidal wave 
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¢ approaches 0 as its limit. From Eq. (4-27a) we have 


4aa*(a + a*)P, 
(w? + a?)(w? + a*?) 
4 : 4aa*(a + a*) P 
= ,, 1 = 
Gltoraerry Ea, 2 
It is permissible to replace the function ®,,(jw) in the brackets by its 


value at wo, since ®nn(jw) vanishes except at the vicinity of wo. Equation 
(4-60) becomes 


ZA _ of + w(a? + a*) + ata + 4aa*(a + a*)Pa/ Sr 
(jo) (w? + a2)(w? + a*) 

N ext, we shall proceed to determine the roots of the numerator poly- 
nomial. Since it is a real and even polynomial of w and s, the roots are 


rear located in foursomes. We write these roots as +b and 
EO”. ms 


ZZ = ®,,(jw) + 


(320) (4.61) 


(w? + b2)(w? + b*2) = wf + w*(a? + a*2) + aa*? + 4aa*(a + a*)P, 


®,-(Jwo) 
q ' (4-62) 
omparing coefficients of the w? term and the constant term 

b? + b*? = a? + a*? = 2(e? — wo?) (4-63) 

o, 4aa*(a + a*)P 

b2b*2 = q2q*2 eA. EN te Me 
a*? + &tias (4-64) 

Nubstituting € + jwo for a, Eq. (4-64) becomes 

b°b*2 = (2 + wo?) (e eer es =") (4-65) 


Taking the square root on both sides of Eq. (4-65) and expressing the 
right-hand side in a power series, we obtain 


see 'g 4P, 8P,2 
bb ="wo" -+ &,, € oe (1 of ae) e? + Rin « (4-66) 


if we define ¢’ and ow, by b = e’ + jut, Eqs. (4-63) and (4-66) can be 
eombined as 


9 WAG *2 LZ « — 2Pn 4P,” 
hm M408 + bm) + Ygbd* = Pret (1 + eee + +s (467) 
Noglecting e? and smaller terms, we have 
2 = ‘i 2Prre 
®++(Jwo) bene, 
wh = Lebb* — 14(b? + b*) = wo? + aes She 


24... (4-69) 


—e+ — GE 
Py W?P yy” 
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Neglecting ¢ and smaller terms, Eq. (4-69) gives 


constant ¢ is determined by the Q value of the available resonant circuit: 


Pee, (4-70) € = wo/2Q, and the constant ¢’ is determined by Eq. (4-68).+ 


We note that the accuracy of Eqs. (4-68) and (4-70) improves as ¢ Case 2. White Gaussian Noise 

becomes smaller. As € approaches zero, both equations become exact. Ban(jo) = Ag? 

From Eq. (4-61) we have "I 
When the noise spectrum distributes over a wide ran i 

2 5: ge of frequencies 

ZZ _ (8% +. b%)(s? + b*2) we would expect the optimum filter to depend on ®,,(jw) for all frequenties 


RR (s* — a*)(s? — a**) 


(4-71) 


Consequently Increasing A*/A2, 


Z_ (st+bj(s+b*) _ 82 + 2e’s + wo? 
R (s+taj(s+a*) 8? + 2es + wo? 


(4-72) 


Equation (4-44) becomes 


Since (R/Z) is very close to unity except at the vicinities of s = +jwo, 
and jws is not likely to be one of the poles, the factor (R/Z) has negligible 
effect on the LHP terms in the partial-fraction expansion. Equation 
(4-73) can be written as 


Real axis 


Pp = 


3? + Qes + wo? 1 [¢] 4 (4-74 
+ 


Nee eh |: y Fra. 4-5. Root-square-locus plot of ®,,/®an. The roots are zeros of ZZ. 
rather than on ®,,-(jwo) of one single frequency, as shown in case 1. Let 


us “ae the spectral density ®,,(jw) of Eq. (4-45). From Eq. (4-27a) 
we have 


We see that Eq. (4-74) can be realized by inserting a filter at the inpu 
end of the system, as shown in Fig. 4-4. For purely sinusoidal noise, t 


_ A%(w? + 02) 
Bo = at oth + Ae (4-75) 


r(t)+n(t) "" 
I'he zeros of ZZ can be obtained, in general, by making the root-square- 
locus plot of A?(@? + w1?)/An2w?(w? + wo”), as shown in Fig. 4-5. Ina 


peel | In INq. (4-74), the engineering “common sense’’ of the optimization process is 
ve His)= ee Agnin illustrated, In most applications, w is outside the servo bandwidth; neverthe- 
a lowy Inq, (4-74) prescribes a notch filter rather than a low-pass filter. The reason is 

een {hut a notch filter has much less phase-shift at lower frequencies compared with that 


ofa low-pass filter, and therefore its phase error is much less. Sometimes the sinus- 
oidal noise is generated inside a control element in the servo loop, e.g., the phase- 
sensitive detector in a carrier system. The same procedure leads to a notch filter 
for the second-harmonic carrier current with the only difference that the constant ¢ is 
Hlotormined by the bandwidth of r(¢) instead of by the available Q because the second- 
Harmonie carrier current has a bandwidth twice that of r(¢). A notch filter in the 
servo loop does not reduce the phase margin nearly as much as does a low-pass filter. 


(b) 


Fra, 4-4, Optimum filtering of sinusoidal noise. (a) Filtering ahead of summ 
point; (b) filtering after summing point, 
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simple case such as the present one the zeros can also be easily calculated: 


ai _ An®(w? + ws”) (w? + wa?) 
ZZ = wo? (w? AS wo?) 


1 A? 1 Ain? 4e2A? 
w3?, 42 = — 3 (wt ar = as 5 (« =F a3) a A2 


From Eq. (4-76), 


(4-76) 


Anls + ws)(s + wa) 
s(s + wo) 


Let %,, be factored into RR. Since ,,, = ®;,, Eq. (4-44) becomes 


Hs Ju RR ae 1 Aw,R(0) if = A(1 — wo) R( — wo) 1 | 
os aie. YZ Raa 8 wo ¥(—wo)Z(—ao) 8 + 0 
_ A fort _ 1-0 _ R(wo) 1 (4-78) 
YZ wo § Wo Y (wo) Z (wo) S&S + wo 
7, = A als +a) 
1 YZ wi!s(s + wo) 


Z= 


(4-77) 


(4-79) 


0100 


“a will — R(wo)/Y (wo) Z(wo)] + wok (wo)/ Y (wo) Z (wo) 
Equations (4-77) and (4-79) give 


where (4-80) 


He A 41 s+ ow 1 
Ano! (s + ws)(s + wa) Y(s) 
W3W4 s§ + oo u 


Ps 4-81 
~ ai! (s + ws)(s + wa) ¥(s) - 


Without noise, the optimum system function F; is 


oi 8 + ie al 
ow, 8 + wi Y(s) 
W1Wo 


will — 1/Y¥(wo)] + o/ ¥ (wo) 


Equation (4-81) can be interpreted as inserting a lag network 1 /(s + wa) 
for filtering and as shifting the critical frequencies w and «: to wi! and 
ws, respectively, so that the low-frequency response is least disturbed. 
Obviously, Fi(s) of Eqs. (4-74) and (4-81) can be paired with F.2(s) of 
Sec. 4-4 to synthesize an optimum system in the presence of both noise 
and load disturbance. The technique is entirely the same as that given 
in Sec. 4-4. 
4-6. Systems with Multiple Constraints. Systems with multip 
constraints can be optimized by a slight extension of the general method, 


Fy = 


(4-82) 


where of = 


(4-83) 
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As will be shown in the following paragraphs, the main effect of additional 
constraints is to alter the function Y(s). 

Let us consider the system of Fig. 4-6. There are two essential con- 
straints in the system: the control field current 7; of the amplidyne and 
the heat loss in both the servomotor and the amplidyne. As the variable 
portions of the heat loss in both components are proportional to the arma- 
(ure current 7,2, we shall impose the more severe limitation of the two on 


i. In case the servomotor is driven by a shunt generator instead of 


d(t) 


ee SH 


r(t)+n(t) 
‘ + 


Kira, 4-6. Block diagram of a system with two constraints (indicated by arrows). 


{he amplidyne, the instantaneous generator field emf e; should be used 
jnntead of zy. 


‘The Laplace transforms of 7, and 7; are expressed in terms of the input 


in" 


I,(8) = aS {[R(s) + N(s)]Fi(s) + [F2(s) — Ga(s)] D(s)} (4-84) 


= he 


I,(8) = F(s)G2(s) {[R(s) + N(s)]Fi(s) + [F2(s) — Ga(s)]D(s)} (4-85) 


whore G5(s) is the transfer function of the servomotor including the effect 
o/ counter emf. The integral to be minimized is 


Jad + kit + Ki = - he vi(s) de (4-86) 
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where 


vi(s) = (Fi - 1)(F; — 1)4,, + PF Pn ote PF Baa + PF @nq + FoF Ban 
+ (Fy 4 L)F Pen ai (Fy io 1)F,@,, — (Fy “os 1)F 28,4 


Me key? ke? - 
Fy, — 1)F 84, = — FyF \(®p, + Pan 
+ Pi — Pats CMe eo F ibn, + 
+ Bin + Par) + M2 — Ga)(P2 — Ga)Paa _ 
+ Fy(F2 — Ga)(®ra + na) + Pi(F2 — Ga) (Par + Ban)] (4-87) 
Differentiating Eq. (4-87) with respect to F; and Fs, respectively, gives 


ky? kee? 
es i ee ES 
" ( + GG, - G.G.GG, 


) [(Brr “t Pan ap Pen = Pn) Fy 


ky? ke? 
+ (Pra + Pra) Fo] = (®,» + am) a Ga (ae + ) (®,a + Pra) 


GG,  GiGiGyG, 


ky? “ kp” 


X = ii i se = 
a Gare 6.6.06; 


) [Beals + (Ben + San)F il 


oF a qe ) Gea (4-89 
"NGG. GiGiGiG 
We shall assume as before that r(t), n(é), and d(é) are not correlated 
and consequently all the cross-spectral densities vanish. Equatio 
(4-88) and (4-89) reduce to the same form as Eqs. (4-25) and (4-26) exce 
that the function Y(s) Y(s) is replaced by 
, k,2 ko? 
Y(s)Y(s) = 1+ =— =——"= 
Go, GyGiG3Gg 
In a positioning system employing an amplidyne-driven shunt motor, 


a. ie —- Kr 

~ 1+T 3 ~ 9s + Ms? 
and Gi = Kr/(ms + Ms), where m = 1+ KrkK./(R, + Ra). Eq 
tion (4-90) becomes 


‘, ky? ke? 
¥(s)¥(s) =1— Kae" — M%s*) — K2 Pn? — M?s?)(1 — T,2s?) 


(4-91, 


The right-hand side of Eq. (4-91) can be factored by solving a third-o 
algebraic equation in s*. While this is not difficult to do, a more gene 
method appears desirable. Equation (4-90) gives 


. key? kG, 
Y(: = G, ky 7 
(s)¥(s) =1+ Gi, (1 1 ee 


(4-90) 


Gi G2 
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Equation (4-92) can be solved, in general, by two root-square-locus plots 
in succession. We shall plot first the root-square locus with (k,2/ks?) 
G\G1G4G4/G.G. as the “open-loop” transfer function. The set of closed- 
loop poles so obtained are joined by the poles and zeros of GG, and 
another set of root-square loci is plotted. The procedure is similar to the 
root-locus plot for multiloop systems. 


w? plane 


I'iu, 4-7. Successive root-square-loci plot for determining the roots Q:, Qz, Qs of YY. 


igure 4-7 illustrates such a plot for Eq. (4-91). Letting 2 denote 
8", we have 
ky GiGiGiG, _ —_k?K%(n? + MQ) (4-93) 
ks? GoGe ko2(1 + T20)(m? + MQ) 
There are two “open-loop” poles at —1/T,? and —7n,2/M? and one zero 
al —»*/M*. The first set of root-square loci is shown in Fig. 4-7a as 
(louble lines on the negative real axis. For a given ratio of ky?2/k2, a 
pair of poles P; and P2 are obtained. If the saturation value J;, of 7; is 
Much larger than (Ry + Ra)Zas/Ka, where J,, is the limiting value of 7, 
the “gain” ratio k,°/k,? would be relatively large. P, is very close to 
~n'/M*, and P, is approximately —k,°K,?/k?T 72. 


| 
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For the second set of root-loci plots, we notice that GsG2 has a pole at 
the origin and a pole at —n?/M?*. The latter cancels the zero at —n?/M?, 
and we have altogether three poles at O, Pi, and P: to start with. In 
Fig. 4-7b the root-square loci are shown as the heavy lines. For large 
gain or small ky, the three branches approach three asymptotes 120° apart. 

Corresponding to a given value of Kr?/k,’, there are three “closed- 


loop” poles Q:, Q2, and Q3. Let qi, q2, and qs denote /-—@,, /— Qa 


and +/—Q3 with their signs so selected that all three are located in the 
LHP. The function Y(s) is obtained as 


ro=(-0-gO-g) wm 


Once Y(s) is determined, the rest of the procedure in synthesizing the 
system is the same as that of Sec. 4-4. 

4-7, Examples of Unstable Systems and Systems with Nonminimum 
Phase. We may define a nonminimum-phase dynamical system as one 
which has at least one zero in the RHP. In contrast, an unstable system 
is one which has at least one pole in the RHP. The latter appears to 
be more difficult to control while operating alone; however, it is far less 
a problem in a closed-loop system. A pole in the RHP can easily be 
removed by closing the feedback loop, but a zero in the RHP remains 
intact no matter what we do with the servo system, using series or parallel 
compensation or any combination thereof. 

As examples in practice, a two-phase servomotor with a rotor resist= 
ance which is not sufficiently high would possess a pole in the RHP, 
A tail-controlled steering system may have a pole in the RHP, a zero in 
the RHP, or both. We shall discuss the servomotor first. The torque= 
versus-speed curve of a servomotor depends very much on the internal 
impedance of the control phase amplifier. The standard procedure of & 
manufacturer’s test is to connect both phases into sources of zero internal 
impedance. If the rotor resistance (in stator terms) is larger than the 
leakage impedance, the motor torque would reduce as it speeds up, am 
shown by the set of solid lines of Fig. 4-8. However, in actual operation, 
the internal impedance of the control amplifier is by no means negligible, 
especially when a capacitor is connected across its output end. If the 
rotor resistance is approximately equal to or lower than the magnetizing, 
impedance, the motor torque may increase with speed for small speeds it 
actual operation, as shown by the broken lines of Fig. 4-8. Let a denot@ 
the slope of the torque-versus-speed curve at zero speed (Y axis). Vo 
small signals, 
dé 
dt 


2 
¢ ; = Ta T,+a ae = Kre +a 
dt : 


yf 1 


(4-9 
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where e; is the input voltage to the control amplifier and @ is the shaft 
position. Equation (4-95) can be written as the following transfer 
function: 

O(s) 2. Kr 

E(s) Js? — as (4-96) 


There isa pole at a/J. If ais positive, the pole is in the RHP, indicating 
that the motor is liable to run away by itself under open-loop conditions. 


Torque, per cent 


Speed, per cent 


Vic, 4-8. Typical torque-versus-speed characteristics of a two-phase servomotor. 


Nolid lines, operation with zero control phase impedance; dashed lines, operation with 
large control phase impedance. 


The pole is readily removed by introducing a minor feedback loop, 
is shown in Fig. 4-9a. The feedback element may be either an a-c 
(uchometer or a friction cup, and the equivalent transfer function includ- 
ig minor feedback loop is 

Kr 
Js? + (8 — a)s adi 


I! 6 is numerically larger than @, the pole in the RHP is removed. An 
allornative is that we may leave the motor as it is and stabilize the closed- 
lnop system by means of a lead network, as shown in Fig. 4-9b. The 
luller method, however, requires precision gears. Gear backlash is 
wiulside the stabilizing loop in case a but inside the stabilizing loop in 
‘ine b, since the main loop is the stabilizing loop. It is not difficult to 
moo Why in the latter case any gear backlash would result in chatter noise. 


G(s) = 


88 SYNTHESIS OF OPTIMUM CONTROL SYSTEMS 


Amplifier 


Motor Gear 


Amplifier and 
lead network 


K(1+T,s) 
1+T,s 


(0) 


Fria. 4-9. Two methods of removing an RHP pole of the load system. 


(a) Minor 
feedback loop; (6) lead-network compensation. 


Neither case imposes a restriction on the form that the closed-loop 
transfer function C/R takes. 

The aircraft-pitch control problem furnishes an interesting illustration 
of the nonminimum-phase effect. With reference to Fig. 4-10, the 


dynamics of a tail-controlled airframe is described by the following 
equations: 


Ma = MV % = Cra — Cro 


J = Orb — Cua — Cag 


(4-98) 
(4-99) 


where the variables a, 8, y, and 6 are measured from a set of initially 
balanced values. The angles a, 8, and y are assumed to be small enough 
so that the gravitational force can be regarded as constant and absorbed 
into the initial values. The forward velocity V is also assumed constant, 
Equations (4-98) and (4-99) describe the balance of force and balance of 
moments, respectively. Their physical meaning can be understood 
from the following example: Suppose that there is a commanding inpu 
for the aircraft to accelerate upward. The acceleration error sign 
causes the controller to actuate a hydraulic transmission system which 
turn raises the elevator of the aircraft. The immediate aerodynami 
force tends, however, to push the aircraft downward as a whole, an 
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particularly to push its tail downward. This results in a slanting of the 
plane with its nose pointing up, and the plane gradually accelerates 
upward but at an oscillatory rate. Mathematically, the immediate down- 
ward force is represented by the term —C'ré in Eq. (4-98) and the turning 
moment is represented by C'ré in Eq. (4-99). The term Cra represents 
the essential lift force which accelerates the aircraft upward. 

Since a = y — 8, the linear acceleration a can be expressed in terms of 


aandg. It is 
_ yp _ _ da 
a=VG-V(a i) 


A differential equation of a in terms of 6 is obtained by eliminating a 
and q in Eqs. (4-98) to (4-100): 


a OF CL da Cu CCy = Cr d*6 
mtg +m) a+(F + yey ~~ Mt at 
Ceca 4 
- IM a at jm (CrCr — CrCu)6 (4-101) 


(4-100) 


l'rom the above equation, the transfer function of the controlled system 
is obtained: 


A(s) _ —(Cr/M)s* — (CrC,/JM)s + (CrC, — CrCu)/JM 
A(s) ss? + (C,/J + CL/MV)s + Cu/J + C,C,/JMV 


We see that a zero in the RHP is introduced by the — (C6 term which 
represents an immediate downward force following an elevator movement 
upward. The locations of the poles depend primarily on the sign of 
(w/J, which is numerically larger than C,C,/JMV in most cases. If 


(4-102) 


Nia, 410, Airframe-pitch linear acceleration system, a typical case of nonminimum 
Phane, 
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the center of force, 0’, is behind the center of mass, O, as shown in Fig. 
4-10, Car is positive and the open-loop system is stable. If the center of 
force is sufficiently ahead of the center of mass, the constant term Cu/J + 
C,C./JMV is negative, and a pole in the RHP is introduced. The 
system will not be stable without automatic control. a 

Ship and submarine steering problems are dynamically similar to the 
aircraft-pitch control problem and can be described by substantially the 
same equations. However, because of the appreciable inertia of the sur- 
rounding water, the effective mass of a ship or a submarine is larger than 
its actual mass and is different in different directions. Let M, denote the 
transverse mass and M, denote the longitudinal mass. For aircraft 
control, Eqs. (4-98) and (4-100) can be combined as 


MVq — MV = Cra — Crd (4-103) 
For ship and submarine control, the above equation becomes 


MWVq— MVS = Cia — Crd (4-104) 
Equation (4-104) will replace Eq. (4-98) while Eq. (4-99) and the kine- 
matical relation (4-100) remain unaltered. The transfer function is of 
the same form as Eq. (4-102). 


Example 4-1. For the system of Fig. 4-1, the following functions are given: 


At : fous 
®,; (jw) = Bar: Pia(jw) = we Grn = 0 
G(s) =1 


— Kine: — 8) (22 +8) 
Gils) = 8(s? + 2fwos + wo?) 


where 21, 22, ¢, and w» are positive constants. Determine the optimum system, under 
the assumption that there is no limitation on the mean-square amplitude of the 
input function to G.(s). Determine the transient responses of the optimum system 10 
step inputs in r(¢) and d(t). af 
Solution. Without power constraint, k = 0 in Eq. (4-27), and Y=Ye=1. 


P=%4- 
Z 


ll 
Es 
ll 


Equation (4-28) gives 


Pl = qa +8) 


Similarly, Eq. (4-29) gives 
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From Eqs. (4-31) and (4-32) we have 


ce aes 
BY! © 5G 
Cc ia DE 
tes ea, a bone 


The compensating network G, and feedback function H are determined from (C/R) (s) 
and (C/D)(s): 


_ (C/R)(s) _ a1 — 8 
G.(s)Gi(s) = (G/D)(s) a! sae 


ae oe 1 _ 8% + Wwos + wo? 
Cherefore, G.(s) = IG) ~ 2K Fs) 
ee! Ae Devers ah. wey = 
1= oR tage ge orn 


The system response to a step input in r(é) is 


sihebony at ie “1 {* 2 } 
Eee ie +88 - s a+s 
= 1 — 26m (4-105) 


The system response to a step input in d(é) is 


o(t) = o- { 28 IB) = ae (4-106) 


Iwquations (4-105) and (4-106) are plotted in Fig. 4-11. 
The mean-square error is 


P= ag Ji {0-2 [Ge] (-) +5) (- 2} 


= Se ea 2 ooo 
Oj pawizeie-ie@ + B*) (4-107) 


‘The significance of Eq. (4-107) is that it gives the lowest possible error even if unlimited 
power is available. This error is completely due to the nonminimum-phase character- 
intic of the load system. 


Example 4-2. In Example 4-1, assume that [/(¢)]? is limited, where i(t) is the input 
(o the fixed elements G,(s), and determine (C/R)(s) and (C/D)(s). 

Solution. From Eq. (4-27) the functions Y and Y are determined by the root- 
tquare-locus plot of Sec. 2-7. Note that Z; = —2,2, Z, = —z:% as before. Let qu, 
(1, und qs be the same closed-loop roots with negative real components as in Sec. 2-7. 
llowever, as 2, and z: are now positive, Eq. (2-55) gives 


— (= 8/qi)(1 — 8/q2)(1 — 8/qs) 
Y(s) = (1 + s/zi)(1 + 8/22) : (4-108) 


since the critical frequencies of Y(s) are in the LHP, by definition. Equations (4-28) 
and (4-31) give 


© 9) Pw (1 = 8/ts) (1 + 8/as) 
_ YP (1 = 8/q1)(1 = 8/q2)(1 = 8/qs) 


(4-109) 
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Equations (4-29) and (4-32) give 


£ 


a a (4-110) 
De 


=1 — Fy(s) 


110) would result in a system without minor feedback loop. This is 


: Pe 
a In steering systems and 


because of our assumption that yg = B?/w* and Ga(s) See Li. ; ; 
aircraft-pitch control systems, the worst type of load disturbance is a turning saben 
rather than a force applied to the center of mass. In such a case, Ga(s) is a function 0: 


s, and a different form of F'2(s) is obtained. 


r(t), d(t) 


Fiq. 4-11. Transient response of an optimum nonminimum-phase system without 
power constraint. Solid curve, system response to unit step input; dashed curve, 
system response to unit step load disturbance. 


4-8, Wiener’s Problem of Optimum Prediction. An obvious situation 
in fire-control problems is that the desired output variable c(t) is not the 
input variable r(¢) at the present but is 7(f + a) at a time a sec later, 
In Sec. 4-1, it is shown that this requirement can be represented by a pres 
dictive operator L(s) = e**. If we assume that there is no constraint due 
to controlled plant or fixed component, Eq. (4-31) gives the optimu 


filtering function as 
am 1 | eens (4-111 
r= 3(2"|, 
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To evaluate the minimum value of mean-square error, we obtain from 
Eqs. (4-19) to (4-22) 


&..(s) = ®,, + FFZZ — LF®,,, — LF,,, 
Substituting Eq. (4-111) in (4-112), the latter becomes 


(4-112) 


1 e@D,, | [e-ed,y, 
Bals) = 5 (@niban — Bride) + | ; ih ; i, (4-113) 
Equation (4-113) can be interpreted as follows: The first term repre- 
sents the error power spectrum with an optimum linear filter having no 
restriction on its physical realizability or pole locations. The second 
term ‘represents an additional error power spectrum when the condition of 
physical realizability is imposed. The proof will be given below. With- 
out consideration of physical realizability, the values of f(s) for different 
s are independent, or we may say that F(s) is an arbitrary real function 
of s. 
To find the optimum function without any constraint, we differentiate 
,.(s) with respect to F(s) and set the result to zero: 


Pie = oe 6 (4-114) 
Therefore p= oe (4-115) 
LZ 
Substituting Eq. (4-115) in (4-112), 
®,,,P. 1 
,.(s) = ®, — atom = — (PyPan — DrnPry 4-116 
(s) Bz a ( ) ( ) 


lquation (4-116) shows that the first term of Eq. (4-113) is the minimum 
error resulting from an optimum but unrealizable filter. 

Normally 4,, = 0, ®,, = ®,, and ZZ = 6,,+,,. This will be 
assumed in integrating Eq. (4-112) to obtain an explicit expression for the 
loust-mean-square error. Let R(s) be defined by 


R(s) = {&,,(s)}* (4-117) 
R(s) can be written in a partial-fraction expansion: 
R(s) = » bi (4-118) 
of SDs 
Lot B(s) be defined as 
_ &*R(s) 


1) follows from the definition of B(s) that all the poles of B(s) are in the 
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LHP, including that at infinity. Equation (4-111) gives 


b.B(p: : 
= [ROB(-Hl. = >; : fe (4-120) 


a 


With the aid of Eqs. (4-118) and (4-119), the four terms of Eq. (4-112) 
can be readily integrated: 


Pe ie 1 ib y B b,b; ds 
Qnj = r= Qnj | —« (s + pi)(—s + pi) 
= » Dee (4-121) 
Di pga? Di 


The above expression is also the mean-square value of the input r(é). 
The second term gives 


jo jo 
a i Fe-wb,,ds= =. |  (FZ)BRads 


207] —jo je 
ck 5 ae bb;BD)B) (4.499) 
= RE 


The above result is obtained by closing the contour of integration with a 
large semicircle at infinity enclosing the RHP. It is easy to see that “a 
gration of the third and fourth terms gives exactly the same result 
Combining all four terms, we obtain finally 


@ = — B(p) Bp; (4-123) 
e dares a (pi) B(p;)] 


Example 4-3. Given ®,,(jw) = A?/w?(w? + 1’) and ®,,(jw) = 1, determine the 
optimum system function which predicts r(t + @). 


Solution 
A? 4 
®r(s) = Bu =a) (4-124) 
A 
asd 3 s(s + w1) 
(s + ai)(s + a2) 
Zs) = "sie +a) 


where a, @ are given by 


i Ne 
aya = + (2 + yt - a) (4-125) 


The signs are so selected that the real components of a and ay are positive, Mquatio 
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(4-115) gives 


“ 1+Ts 
PG) = (+ s/a jC + 87a) eae 
where 7 = [1 — B(w:)]/w1, and 
Bink = Pees (4-127) 


(1 + @1/a1)(1 + w1/a2) 


The mean-square error can be calculated from Eq. (4-123). The two roots of 
R(s) are 0 and w;. Among the four terms of Eq. (4-123), the term with pi = po = 0 
gives the indeterminate form 0/0. In order to determine its value, we assume that 
Pi = p2 = «, a negligibly small but nonvanishing constant: 


A? Az 1 1 
deal — BOBO) = 4{(a+2 +2 


Equation (4-123) gives 


a2 2 


aa% 1 _ pat 
as a (a+2 + T ) (4-128) 


4-9. Optimum Filtering of Two Related Signals. When we have a 
single signal mixed with noise, any attempt to filter out the noise would 
also filter out some of the signal. This is no longer true when we have 
two independent ways of measuring the same signal. Let J, and J, be 
the outputs of two independent instruments measuring the same signal R: 


Ii(s) = Hi(s)R(s) + Ni(s) (4-129) 
I,(s) = H2(s)R(s) + N2(s) (4-130) 


Where H,(s) and H.(s) are known operators and Ny, N» are independent 
ise sources with known power spectrum. The problem is to find linear 
operators Gi(s) and G2(s) such that 


C(s) = Gi(s)1i(s) + G@2(s)I2(s) = R(s) + N2(s) (4-131) 


where N; is independent of R(s) and is to have least-mean-square value. 

‘To see how a problem of this sort comes about, let us consider a naviga- 
tion problem: The latitude of a submarine is to be determined without the 
benefit of taking a star fix. A gyro pointing north can be fairly 
wccurately maintained since it does not mutate within a 24-hr period. 
The angle between the vertical axis and the gyro axis gives a direct 
(determination of the latitude. Alternatively, the change in latitude can 
he determined by integrating the velocity or twice integrating the 
neceleration in the north-south direction. The integrated signal is more 
iecurate in giving the instantaneous variations in latitude while the gyro 
signal is more accurate in giving the long-range or averaged latitude. 
What is the best way of combining these signals to come as close as possi- 
ble to the instantaneous latitude? 
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To solve the mathematical problem as outlined above, let us note that 
G,(s) and G(s) are not independent, owing to the requirement that N3 is 


to be independent of R(s). Let F(s) be defined by 


F(s) 
H(s) 


G(s) = 


Substituting Eqs. (4-129), (4-130), and (4-132) in (4-131) and comparing 


the coefficients of R(s) give 


G2(s) = : Ta 
_ F(s)Ni(s) , [1 — F(s)|N2(s) 
and N3(s) = H,(s) + His) 


Since N, and N» are independent, Eq. (4-134) gives the following rela- 
tion between the spectral densities 411, zo, and 33 of Ni, Ne, and Nz, 


respectively : 4 : 
es FFé, , A-—F)1 — F)b2 
- A.A, AH, 


For the minimum mean-square value of n3(t), Eq. (4-37) gives 


@uF , dx(F — 1) _ y 


HH, H>He 
The solution of Eq. (4-136) is 
pu.| 2 | 
Z| AH |, 
where Z is obtained by factoring 


ZZ = 


Pu Poe 
= bene 
Ai, HH, 


Example 4-4. Given measured position and velocity signals 4 and 72 which are 


mixed with noise of spectral densities @,; and #22, respectively: 


A2 
Onis + a 

B? 
Dis se +a? 


(a) Determine G, and G2 to obtain a position signal with least noise. 
(b) Determine G, and G2 to obtain a velocity signal with least noise. 


Calculate the mean-square value of the noise in each case. 
Solution. For case a, H, = 1, H2 =s. 


ea eras Lemme at oh 
ae —s? + a! * —s*(—s? +a?) —s*(—s? + a?) 
a A(s +b) 


8(8 + a) 


OPTIMUM SYSTEMS WITH RANDOM INPUTS 97 


where b = B/A. Equation (4-137) gives 


p = 36+ [ B2 
A(s + b) LAs(s + a)(—s + 5. 
- Beto s- 1 i aoe 
(4-132) s+6 Labs aa+b)(st+a)] @ +b) +6) 


Equations (4-132), (4-133), and (4-135) give 


_ bis +a+b) 
Gi(s) = G@Ltbh@rD (4-141) 
4-133 teen. 
( ) G2(s) G@tper (4-142) 
_ _d'lw* + (a +b)? + 
(4-134) 80 = GED Pa FH Sot 


Integrating Eqs. (4-139) and (4-143) gives 


mE 
ny = 4a 
— _ A?b(2a + b) 
(4-135) ons 4a (a + b)? 
— — a. 2 
m1? — n3* = n,? € +) (4-144) 
Hquation (4-144) gives the improvement over using signal 7; alone. 
(4-136) For case b, Hi = 1/s, He = 1. 
az = A*(—s? + b?) 
—s? ~- a? 
ae A(s + b) 
(4-137) al 
Pity ick +a [ B? | a b? 
A(s +6) LA(s +a)(-s+b)], (a + 6)(s + b) 
Wquations (4-132), (4-133), and (4- i 
(4-188 Dir ik ), and (4-135) give 
Bd bs 
HO) = GFE FD woe 
_ s+ab/a+b 
G2(s) = ewe’) (4-146) 
_ B(a? + 2ab + 2b2)w? 22 
(4-139) G33 = (a + b)2(w? ee + 35) (4-147) 
(4-140) Integrating Eqs. (4-140) and (4-147) gives 
yee 2 
i) gad 4a 
—; _ B*(a + 2b) 
= — 
4(a + b)? 
Tae pe eee t 2 
nat — ng = na (“4 3 (4-148) 


Hquations (4-141), (4-142), (4-145), and (4-146) show that the best way 
of obtaining the velocity signal is not to differentiate the optimum filtered 
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positional signal. In fact, if the latter method is used, the mean-square 
value of n; turns out to be infinite. This is to be expected, since an ideal 
differentiator increases the noise power in proportion to ’. 

4-10. Summary. In Chap. 4, we have formulated in general terms the 
mathematical problem encountered in minimizing mean-square error in 
the presence of noise, disturbance, and conditions of constraints. The 
problem involves simultaneous determination of a number of unknown 
system functions. 

An optimization theorem is stated and proved. In order that the 
error integral be a minimum, the functional derivatives of the integrand 
must not possess any poles in the LHP. This is an interesting contrast 
to ordinary variational calculus which states that for an integral to be 
stationary the functional derivatives of the integrand must be zero. 

When load disturbance is present, two optimum closed-loop transfer 
functions, F; = C/R and F, = C/D, are simultaneously determined. 
These two functions in turn determine the series- and parallel-compensat- 
ing elements. When noise is present, the optimization procedure auto- 
matically specifies a noise filter as an integral part of the system. 

The presence of additional constraints can be taken into account by 
successive root-square-locus plots. It modifies the function Y(s), which 
represents system compensation. 

A nonminimum-phase system is defined as one with a zero in the RHP. 
An unstable system is defined as one with a pole in the RHP. The pole 
can easily be removed and does not constitute a limitation on the system 
response. A zero in the RHP is shown to be an essential limitation which 
cannot be removed by any means whatsoever. It restricts the forms of 
C/R and C/D and thereby imposes an intrinsic error on the system 
response. As typical illustrations, a two-phase servomotor with insuf- 
ficiently high rotor resistance is unstable under open-loop conditions. A 
tail-controlled airframe or ship represents a nonminimum-phase system, 
It may also be unstable, depending on the relative positions of center of 
force and center of mass. 

Optimum filtering and prediction of a stationary random signal with 
and without noise are treated as a special case of the general procedure 
and are illustrated by typical examples. Both the optimum system 
function and an expression for the least-mean-square error are obtained, 

While the optimum-design method is based on the assumption of 
a stationary system, its application is much wider. Whenever the 
changes in a nonstationary system are slow enough so that the prevailing, 
condition can be approximated by transfer functions and spectral densi« 
ties (which vary with time), the method may be used to advantage. ‘Th 
exact methods of analysis and synthesis of nonstationary systems t& 
handicapped by a number of factors: 
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1. The exact variations in system d i i isti 
ynamics and signal st: 
be known beforehand. zi ai 


2. The methods are numerically complicated. More often than not 
digital computers are needed. 


3. The optimum system as calculated is usually difficult to realize as 
far as hardware is concerned. 

4. Raspignces i variations in system dynamics and signal statistics 
are slow in terms of system response ti i 
ves si at ponse time, the advantages to be gained by 

For the above reasons, what we may do for slowly varying nonstation- 
ary systems is to design a system which would be optimum if the prevail- 
ing condition were stationary. A number of constants in the controller 
depend on the spectral densities and transfer functions, which represent 
the prevailing condition, and are adjusted with an auxiliary servo to the 
optimum setting from time to time as the latter varies. A system of this 
type is known as an adaptive system. Alternatively, we may optimize 
the system for the heaviest load or worst condition. In other words 
the spectral densities corresponding to the worst conditions are vised. 
Another alternative is to optimize on a weighted-average basis, by ake 
the weighted-average spectral densities. Obviously, sombsnaueny of 

such techniques can also be used if the nature of the problem so dictates. 


CHAPTER 5 


NONSTATIONARY SYSTEMS AND SYSTEMS 
WITH LIMITED OBSERVATION TIME 


In the present chapter, we shall take up the problem of optimum design 
of nonstationary systems and systems with limited observation time. 
While analysis in the s domain is simple and closely related to traditional 
servo theory, the method is not applicable to nonstationary systems and 
systems with limited observation time. The time-domain treatment of 
this chapter will be put in more general terms to include the above- 
mentioned systems. 

We shall start by generalizing the definitions of correlation function and 
impulse-response function, so that nonstationary signals and time- 
dependent systems can be described. An integral equation for the 
impulse-response function of an optimum system is derived for the gen- 
eral case. Reduction to special cases and methods of solution of the 
integral equation are then given. 

5-1. Nonstationary Systems. Nonstationary systems can be classi- 
fied into the following types: 


1. Systems with time-varying fixed component 

2. Systems with nonstationary inputs 

3. Systems with both time-varying fixed component and nonstationary 
inputs 


As examples of the above types, let us consider first the missile-control 


problem: The fixed component, which is synonymous with the controlled- — 


system dynamics, varies because of the diminishing weight of the fuel, 
the changing speed of the missile, and the changing density, temperature, 
composition, etc., of the surrounding atmosphere. If the disturbing 
forces and moments due to atmospheric turbulence are negligible or are 
stationary and if the statistical nature of the instructions to the con 
trolling elements does not change, then the inputs are stationary and the 
system belongs to type 1. Otherwise, the inputs are also nonstationary 
and the system belongs to type 3. Fire-control systems are usually of 
type 2. Another example of type 2 is encountered at the initial seconds 
of applying an input signal to a system. Since the stochastic inputs do 
100 
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not go back to infinity, the statistics of the system output are not the 
same as those which exist after the input signals have been applied for 
some time. 

In order to obtain optimum control in the least-square-error sense, the 
controlling elements also have to be time-dependent. Therefore, the 
response of the total system must vary with time whenever the above- 
mentioned situations arise. 

5-2. Mathematical Representation of Time-varying Linear Systems. 
A linear system may be defined as one that obeys the superposition 


v(t) 5: 
Sum signal of the 
response pulses 


Response pulses 


i(t,) dtg(t,t,) 


I'iq, 5-1. Response of a linear system can be approximated by the sum of impulse 
responses to input at various times. 


theorem. Its characteristics are completely specified by the impulse- 
response function g(¢,t:) which represents the system output at time ¢ due 
(o a unit impulse input at time é;. 

An arbitrary input function i(é) can be approximated to as close a 
‘logree as desired by a sum of impulse functions of magnitude i(t;) ét, 
(ty) Ol, . . . , at time t), t, . . . , as shown in Fig. 5-1, by making the 
interval 6¢ arbitrarily small. Owing to the superposition theorem, the 
system output v(¢) is the sum of responses to these impulse functions. 
‘Therefore 

v(t) = lim Y [i(tn) dé]g(t,tn) 
bt-0 


n 


= fess g(tyti)i(ts) dey (5-1) 


ae 
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A physical system is incapable of responding to inputs which are not 
yet applied, and the upper limit of the integral is t. However, in a hypo- 
thetical system representing a desired performance (e.g., prediction), 
this restriction is not always valid. Equation (5-1) can be written alter- 


natively as 


o(i) = [7 a(tti(h) dtr (5-2) 


The incapability of a physically realizable system to respond to future 
inputs can be expressed as 
g(t,ti) = 0 fort; >t (5-3) 


For a linear system that does not change with time, the system response 
to an impulse function depends only on the time interval between the 
application of the input impulse and the time of observation. It may 
be described as invariant under time translation. Instead of writing 
g(t,ti) we may write g(t — 4). Equation (5-1) becomes 


o() = fi, 9 — Wilh) dts = fy” site — 2) ar (5-4) 


The second equality sign is obtained by substituting 7 for ¢ — h. Physi- 
cally, 7 is the interval that has elapsed since application of the input. 
Equation (5-4) is known as a convolution integral in the time domain. 
We note that Eq. (5-4) holds if the system satisfies two conditions: 


1. Linear 
9. Invariant under time translation 


If condition 2 fails, Eq. (5-4) fails also; however, Eq. (5-1) still holds. 
The Laplace transform of Eq. (5-4) is 
vis) = f, [J o@ie — 9) ar at 
= {fF fe e-tg(n)e-*-Pi(t — 7) de dt = G(s) I(s) 
The system can be represented by a transfer function in the s domain, 
Physically, the system is composed of constant elements. However, with 
an impulse-response function g(t,t1) which depends independently on ¢ 
and t;, the above transformation is not possible. The function (tt) 


represents a system with time-dependent elements, or varying parameters, 
The correlation function of two random signals is defined, in general, i 


gix(tite) = (iy (tr)ta(te)) (5-5) 


where the pointed brackets represent the ensemble average. For st 
tionary signals, pi2(ti,t2) depends on the interval 7 = t: — t only. Wquar 
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tion (5-5) becomes 

git) = (ti(ts)e2(tr + 7)) (5-6) 

From Eq. (5-5), it is trivial to verify the following symmetry properties: 


Pir(ti,te) = drr(to,t1) 
diz(ti,to) = gor(te,t1) Kate 


Another important relation expresses the correlation functions of th 
output variables in terms of the inputs. In Fig. 5-2, the correlation f s 
tion Pirie (tr,t2) is known and the correlation tagations of the output a 
ables v(t) and ve(é) are to be determined. From Eq. (5-2) St 


v1(t)v2(t’) = ie gi(t,ts)2(t1) dt, to, go(t' ,t2)¢(t2) dts 

= ries i gi(t,ti) go(t' ,t2)21(t1)¢2(te) dt, dt, 

‘Taking the ensemble average of the above equation results in 
dovsltt) = [7 [7 ailtstrdgalt te) diisltasts) dts dts (5-8) 


[quation (5-8) i ; 
pit: (5-8) is good whether gi(t,é:) and go(t,t1) are physically realizable 


i, (t) 

7 an 

hd g,(t,t,) va(t) h 
ra ——>-—-4 Alt,t,) KL» v(2) 


Nia, 5-2. Relation betwe i 

Pitiok , ’ en correlation Fia. 5-3. Desired s 

functions of nonstationary input and out- and actual system pee he ol 
pul signals to and from a time-varying iat 

linear system [Eq. (5-8)]. 


| 
| 
Y bi e(t) 
I 
i 


ce Dese-epiace Optimization of a Linear Nonstationary System. 
igure 5-3 illustrates the criteria by which the system function g(t,t,) 
in Lo be determined. The desired output v(t) is obtained by a tie 
linear Operation, with impulse response A(t,tr), on r(t). The sent in i 
in masked by additive random noise n(t). The error e(é) is Hea nes 
v(t) = e(4), and the ensemble mean-square value (e?(t)) is to be aintinixed 
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for every instant of time. By definition of e(t), 
Pee(t,t) = dov(t,t) <i Pec(t,t) as Whev(t,t) (5-9) 


We note that ¢».(é,t) is independent of g(¢,t1), and the et 
dbec(t,t) and dev(t,t) on g(t,t:) can be exhibited by making use of Eq. (5-8) : 


baltt) = belt) + [7 [7 alttdaGite) brn (tuts) dtr dle 
=e | J ‘ ”  a(tyts)h (Lyte) drir(tr,ta) dtr dtz (5-10) 


Let go(t,t:) denote the optimum system function and Pont (t) denote the 
inkawam value of ¢c(t,t) which is obtained by substituting go(t,tr) in 
Eq. (5-10). Then an arbitrary system functin g(é,t:) can be written as 


g(tyts) = go(t,tr) + egi(t,ts) (5-11) 


The condition of physical realizability requires Eq. (5-3) to hold a 
three functions, g(t,t1), go(t,tx), and gi(t,t:). We require further tha 
system have finite memory T(¢): 


g(tyts) = go(ttr) = gilttr) = 0 
Equation (5-12) does not restrict our solution but rather generalizes a 
since we can always apply the solution to the case of infinite nn on 
making 7'(t) approach infinity for allé. Substituting Eq. (5-11) 7 . y 
rearranging terms, and writing Eqs. (5-3) and (5-12) into the limits o 
integration give 
Gealtst) = dont) +. ff, guidsta)grltste) bran (tata) dla dts 
+ 2¢ f' f gultsts)goltsts) ron (tists) dts dts 
t’ U 


i Dy i iat l wea galtyts)h(Lyts) drir(trjea) dtr dt2 (5-18) 


=t! 


forallt; <t— T(t) (5-12) 


where t’ = t — T(t). Equation (5-18) is obtained by making us 
(5-7). Since the second term on the right-hand side is ase 2 “a 
sufficient condition for making go(t,t:) the optimum system une a ; 
that the net value of the last two terms vanish for arbitrary gi(t,li) 


Therefore 


[i goltsts) nn (taste) dt, = fit A(tyte) Prr(lrjle) dee 
7?’ 


Equation (5-14) is also a necessary condition, since the second io 
the order of e? and the last two terms are of the order of « =H *: ve 
does not hold, it is always possible to make ¢ small enough so t _ 

second term is insignificant and to choose the sign of esuch that oo(tt) 


t'<t,<t (5-14) 
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Gopt(t). This clearly contradicts the proposition that go(t,ti) is the 
system function that minimizes the ensemble average of [e(¢)]?. 

Both h(i,t2) and ¢,,r(t1,t2) are known functions, and the right-hand side 
of Eq. (5-14) can be integrated. The result is readily seen from Kq. 
(5-8) to be the correlation function ¢,,.(t1,t).. Equation (5-14) becomes 


1 Grinllagts)go(tyts) de = dre(tr,t) (5-15) 


Equation (5-15) was derived by Booton. 

For every value of t, $,,,,(t1,t2) and ¢,,»(t1,¢) are known functions, and 
tand ¢’ are known constants. The function go(¢,t2) is to be determined as 
a function of tf. Equation (5-15) is an example of Fredholm’s equation 
of the first kind, for which no general solution is known. However, if 
ri(t) can be represented as the output of a linear differential operator 
with time-dependent coefficients operating on white noise, Eq. (5-15) can 
be solved by a method devised by Miller and Zadeh. The steps are rather 
involved even for a single value of t. Their solution is useful for research 
purposes to gauge the closeness of any approximate solution to their 
exact solution in extremely simple or extremely important cases. It is 
too complicated to be used for ordinary work. 

The function ¢,,+,(t1,2) is called the kernel of the equation. It deter- 
mines the complexity in arriving at a solution. In the present case, 
the kernel is symmetrical, since ¢,,,,(t1,t2) = rr, (toyt1). 

5-4, Filtering or Prediction of Deterministic Inputs in Random Noise 
with Finite Observation Time. In fire-control and inertial-guidance 
systems, one significant problem is the recovery of a deterministic signal 
that is masked by random noise. The desired signal can be described 
as a linear combination of a set of known signals with unknown coef- 
licients. For instance, the bearing of an airplane can be closely approxi- 
mated by a low-order polynomial in time t, except when it suddenly 
switches course. We should like to be able to predict its course as closely 
us possible, not counting the sudden switches, and to do this in as short a 
lime as possible. Obviously, if the observation time is too short, we 
ounnot distinguish signal from noise. If the required observation time is 
(oo long, the system becomes impractical; furthermore, it would take 
into consideration irrelevant data as well as relevant data. Another 
example is optimum filtering: The error of an inertial reference system 
hus both a random component and a coherent or sinusoidal component of 
oxtremely long period. While the former is unavoidable, the latter can 
lw eliminated by accurate adjustment, which depends, of course, on the 
mnsitivity and reliability of detecting such coherent components. The 
observation time must be considered limited because of the long period of 
(he sinusoidal error, 
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To attack this problem, we shall start by modifying the results of the 
io) ) 


Vv i have 
revious section The input signal to the s stem 1S assumed to 


three components: 


1. Random noise n(t) 
ignal r(¢ | 
A Ee wink ? a linear combination of known signals fi(t) with 


unknown coefficients Ai, namely, 


~1 
E(t) = ¥ Ahi (5-16) 
inputs r(f) + n(t). 
denotes the sum of random input 
lea the impulse-response function g(é,ts) to satisfy the 


conditions that 


ff eadoten) dtr = 7, EERO) dh (5-17) 


i t) as 
and that the mean-square error dee(t,t) due to random inputs 11(¢) 

j Eq. (5-10) be a minimum. . al 
Be ei ae that the system gives exactly a ge ‘— | 
poker on £(t). Owing to the arbitrariness of A; in hq. (9-19); 
(5-17) is equivalent to 


i j= ..,N (6-18) 
[/saedotu) dts = Jl, fib) dh t= 1,2,---5 
ze 
i . There 
For any t, the right-hand side of Eq. (6-18) is a pide ee j con 
are infinite points on the line segment t’ < ti = : ieee ee a 
tion is disregarded, corresponding to each poin we aCe 
there is an independent value of g(t,t1)- For : e in a 
alues of g(t,t:), only N linear equations exist. onseq Sy ie 
e infinite puniber of solutions which satisfy Eq. hehe Pe 
the one that yields a minimum value ¢ec(t,t) is the op 
ae adeeb Eq. (5-11) as before, we obtain Eq. (5-13) together with 
N equations arising out of Eq. (5-18): 


j= N (5-19) 
ef, fade) dh=0 4 =1 Bae 


Eq uation 5-19) can ipli by 8 f Lagrang Y's multipliers 
i be multiplied a set 0 e 4. 
2n (t) 1 d Aen! to K Q, ( )- 1 3) As mentioned before, we oh deter aol 

i a at ") J 4 . 4 . x a : 
i A different sel of values 
optimt m go(t t for each given value of t. . ad ich va 
iit \ Y 8 8 exp! SC b 


A(t). 
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Equation (5-15) becomes 


i=N 


Jy 9olbst2)drin(lats) dts = droltsh) + Y MOfc) (6-20) 
i=1 
The solution of Eq. (5-20) can be expressed as 
N 
golt,ts) = goolt,ts) + Y) re(t)goi(t,tr) (5-21) 
t=1 


where goo(t,t2) satisfies Eq. (5-15), and go;(t,t) satisfies the following: 


Jy Goiltstz)res(tiyte) die = fir) %=1,2,...,N <u <t 


(5-22) 
The functions \(t) are determined by substituting Eq. (5-21) in Eq. 
(5-18). 

It is interesting to note that the set of integral equations (5-22) is of 
exactly the same form as Eq. (5-15). These equations are usually diffi- 
cult to solve. However, there are special cases which are practically 
significant and mathematically manageable. These will be taken up in 
the following sections. 

5-5. Time-invariant System with Finite Observation Interval. As 
shown in the previous section, the general solution go(t,t:) represents an 
impulse-response function having a waveform depending on the exact 
time of applying the input. The optimum system is not only difficult to 
instrument, it is also difficult to use, because the problem of synchronizing 
the system to the observed object is not always trivial. Under reasonable 
assumptions of the nature of the input, however, the solution is reduced to 
time-invariant form, that is, go(¢,t:) is a function of t — ¢; only. The 

above mentioned practical difficulties are then alleviated. 

The assumptions are as follows: 

|. The random inputs r(é) and n(é) are stationary. 

The desired linear operation does not change with time, h(t — t,). 
The allowed interval of observation 7’ is independent of t. 
The set of known functions f,(t) satisfy the following equations: 
N 


ie+r) = Y af i1=1,2,...,N 


j=1 


2. 
3. 
4, 


(5-23) 


‘To demonstrate that condition 4 is not too stringent, we note that it is 
sitistied by the following functions: 


|, Ixponential function e% 
2, The set of sine and cosine functions, sin wt and cos wt 


4, ‘The set of powers in t, from zeroth bole D) shield, dy esas a stNat 
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On the other hand, Eq. (5-23) is not satisfied by most other functions, 
for instance, any set of functions including 1/t. 
If it is assumed that conditions 1 to Aare satisfied, Eq. (5-18) becomes 


[ip fledoe) at, =f" fhe) dh $= 1,2,...,N (6-24) 


Letti =i: +t—T7. Using Eq. (5-23), Eq. (5-24) becomes 


¥ ast = 1) fy" Heda + 4 — 7) ds 


P] 


= y, aul i [sont sy eee eee 


A sufficient condition for satisfying the above equation is 


J," Hiatt, & + tT) dh = [L,Stan(r — ts) dt 
5=1,2,...,N (5-25) 


Equation (5-25) is also the necessary condition, provided that the deter- 
minant of a,; does not vanish. The latter condition is always satisfied. 
Similarly, by the substitution t12 = tsa +t — T, Eq. (5-10) becomes 


butts) = on0) + ff, ete tt— Tote +t— 7 
brr(ta — ta) dts dt, — 2 ee pale g(t, ts +b T) 


= 


h(T — ta) brir(ta — tz) dts dts (5-26) 


We note that ¢; and ¢ are dummy variables and can be switched to what 
ever symbols we like. The unknown function g(t, ts + ¢ — T) is required 
to satisfy Eq. (5-25) and to yield a minimum value for the right-hand side 
of Eq. (5-26). Since none of the other functions and parameters involved 
depend on #, the solution is obviously independent of t. The only possible 
solution is then 


g(t,t) = g(t — hh) (5-27) 


which implies that g(t, ts + ! — T) = g(T — 4s). This proves our propo» 
sition that, if conditions 1 to 4 are assumed, the optimum system is invarl« 
ant with time. Stated in another way, if a system is shown to be opti« 
mum at the end of the first T sec, it continues to be optimum thereafter, 

In the equations, d,(¢) are no longer dependent on t. In Eqs. (5-15 
and (5-22), we may set t= T. Letras 7 — te Wquations (5-15) 
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1 
(5-22), and (5-21) become ik 


ig 
I Joo(T2) Prin(T1 — T2) drs = dro(71) 


: OVS ST 6228) 
if Goi(T2) brn (71 — 72) dro = f(T — 73) 


ll 


. O<n<T (5-29) 
go(r) = goo(r) + a Agoi(7) (5-30) 


The N coefficients ), ar i 
; are determined by substituti 
) ng Eq. (5- i 
following N equations, which are obtained from E (6-35 ‘" a at 
I F ( T °° “ 
» Gor f(T — 2) dr =f" Aef(T —7)dr 6 = 1,2, N 
es 
; . (5-31) 
Bac ae bears a = noe Pate unknown coefficients a) and a, is 
: Na pubiets » Pnal(t) = 7): etermi i i 
unerring prediction for a sec ahead, if the noise Sear bale nugslie fs h sey 
east- 


square error in general. The observation time is 7’ sec 
square error for the system. 


Solution. In terms of the previous notation, we have 


Determine also the mean- 


A(z) ‘ak OG + a) dryr(7) =0 
lquation (5-28) gives 4 fot) =t 


Wquation (5-29) gives 1 lai 


bY 
I Bgoi(r2)8(r1 — 72) dr2 = Bgoi(rs) = f(T — m1) 
Therefore, 


oa 1 
91 = B = Grr) = F(T —7) 


At LN i 
go(7) = B oir 5 t=) ba 
Substituting (5-32) in (5-31), we obtain 
2 
Tr, + z 2 = B 
hes T? 
aM +3 rs = B(T + a) 
Nolving the above equations for \; and dz, 
B 
M=-—-=F = 
1= -7(2 +64) 
B 
de = (6 + 12 
al “ 
wir) = 7[(6 +124) (1-5) -(2+68)] 
ach a 
p[(4+6¢) -(6+124)5] (5-33) 
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radar uncertainty is equi 
quivalent to letting A infini 
(4-126) gives the optimum system radia di ace” spanish 


Equation (5-26) gives 

= T 4B a a? 

= i [go(7) 2 dr = - (1 +3 T +3 a) (5-34) 
F Ba F(s) = 

Equation (5-33) shows that the optimum impulse-response function is independent (s) 1+ Tis 

of B which represents the noise power per unit bandwidth, and Eq. (5-34) shows that 
the mean-square error is proportional to B. Figure 5-4 gives a plot of g(r) as & func- 
tion of r for a/T = 0 and 14. We note that, as a is increased, the area representing 


the response of reversed polarity is also increased. 


oki 
oi ogee | Wad tie RR 


Alternatively, we may use the + 
eel : : mba sah ochaa shea dae A Rta Se, Sy letin:. <p Peabiems 
There is a significant difference 1n basic concept between the optimum- polynomial of high order to approxi , ol course, possible to use a 
design method presented here and the one presented in Chap. 4. We proximate not only the straight path 
recognize here that the input has a 
coherent component &(t) and ran- 


g(r) 
L dom components r(t) and n(t). 


$ While the least-square criterion 1s 
al i the best for the random compo- 
T nents, it may not be so for the 
2 coherent component. In fire-con- 


trol problems, 2 miss may as well 
be a mile. It is more desirable to 
have a tolerable error for the largest 
fraction of time than a least-mean- 
square error over all times. This 
concept was introduced by Zadeh 


} ae 
Fra. 5-4. Impulse-response function g(r) and Ragazzin}. 

of an optimum system in the presence of To see what the result means, let 
white noise for the prediction of ao + ait US consider the problem of shooting 
i with finite observation time igs at a zigzagging airplane in its sim- 
plest version. We assume that the 
plane has a constant-velocity component in the x direction, changing» 
velocity component in the y direction with uniform distribution between 
+V, and switching at random intervals, the averaged value of which is 
1%. The prediction time a is the interval required for the projectile to 
reach the plane, and our problem is to predict the transverse position 
y(t + a) correctly. It is further assumed that the radar introduces & 
positional uncertainty with uniform power spectrum but very low-mean= 
square amplitude. We shall compare the “optimum” systems accordingt 
to the two different concepts. 

For strictly least-square-error criteria, the solution is given in See 


| 4-8. The spectral density (jw) is 


ye Rim oOo 8 


lia. 5-5. Compari 

5-5. parison of a system witl 

errorless constraint for apie wenden ss orem 

roves | ie juare pre ion (a) and a syst i 

variable; broken line, predicted See CATR Prey een rae ine ata 
: ? 


hetween two switchings but al 
abi elit ae iso a longer path includin itchi 
_ ar eben neal because as soon as we ue ee 
Sa at or powers of ¢ in the predictable set ni fer ee 
ghee ot oise or radar uncertainty becomes aniclerabl la ae 
saad hE Ag se appears to be the least ambitious one We hall 
e plane's y position by a linear function of t If Rig cvs 
: e 


aw i ( ches cours € i 

The r olut io i 5 b fore the arrival of the projectile it has to be a miss 
; : J. Is #1 en 1n Example 5-1 A t ; 
a, a Nn 1S £1v . s he radar uncertainty is very 


is permissi 
permissible to use a short observation interval 7 


| Ireure ved ill , r , L , i) 
bs) 5 e situeé tion with somewhat exaggerated error 
| he He tual y(t) as a function of t 18 shown by a solid line. In I 1g. 5-5a, 


2 
,, (jo) -- w?(w? + v*) 


which is the same as Bq, (4-124) witho,; =». The condition of negligib 


—————— OO — 
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the broken line represents the predicted y(t) of a system with strictly 
least-mean-square error. The fact that 71 < @ gives insufficient antici- 
pation of the subsequent change. In Fig. 5-5, the broken line represents 
the predicted y(¢) of the system of Example 5-1. We see that, in the first 
a sec after switching to a new course, the error is larger in b than in a. 
However, in the remaining intervals the predicted y(t) of Fig. 5-5b coin- 
cides with the actual y(é). While Fig. 5-5a gives the least-mean-square 
error, b represents a much greater probability of bagging the target plane. 
Where the effect of noise is considered, the predicted y(t) is distributed 
about the broken line, with the broken line as mean value in both cases. 
The above observation still holds. 

5-6. General Solution of a Time-invariant System. Example 5-1 


represents an extremely simple case with its ke 


Normally the solution is much more involved. We shall discuss & 
general solution under the assumption that ®,,7,(8) 18 & rational function 


of s. 

(5-29). Both equations are of the form 
T: 

[pao — 7) dr = FO 


and g(t) = 0 fort < T;,andt > T2. 


We know also that ¢(¢) is an even function and that its Laplace trans- 
form &(s) is a rational function of s. Since ¢(¢) is even, ®(s) is a function 
of s? and can be expressed as N(s)/D(s), where both N(s) and D(s) are 


real polynomials of s’. First, we shall consider the simpler case. 


Case 1. N(s) = 1, &(8) = 1/D(s). Let p denote the operator d/dt. 


Operating D(p) on Eq. (5-36) results in 


TM 

What is this function D(p)o(t — 7)? Its Laplace transform is 
D(s)®(s)e" = &* 

Therefore 


When Eq. (5-38) is substituted in (5-37), the result is 
g(t) = D(p)/fO 


At the two ends of the integral, t = 71 and t = 72, the 6 function and | 
derivatives possibly exist. It will be shown that the 6 functions are, 
general, necessary. Let the roots of the polynomial D(p) be deno 


+b, 62 1,2,-65.% where 2m is the order of D(p) in p, and ther 1 


components of the b/s are negative by definition, We assume that bi) # 


rel being a 5 function. 


The most difficult hurdle is to solve the integral equations (5-28) and 


T,<t<Ts (5-36) 


[Zr Doe — Ndr = DOO (5-37) 


D(p)o(t — 7) =  — 7) (5-38) 


T,<t<Ts (5-39) 
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fc ; : 
t #j. Then, for arbitrary coefficients A; and B; we have 


D(p) [fF + » Axe + ¥ Be] = D(p)f() (5-40) 


From Eq. (5- : 

nettles - si bi tan that different f(t) can lead to the same 

bowadiedrnee ae . ds On the other hand, if the values of a 
1 and t = 7», the integral on the left-hand side of Ba, 


(5-36) gives a uni : 
quely determined f(t : Age 
ated by allowing 6 functions at oe ie pegs ae 


a) = DOO + ¥ ast - ae 
> 6O(¢ — Ty) + » B6(T, — 1) (5-41) 
From Eq. (8-11), 
¢(t — 1) = » Cierilt—r1 (5-42) 


h 7 Ww 
where the C;’s are known constants. From Eqs. (5-41) and (5-42) 
a ) 


m 


Jr. 9) — 1) dr = fl) + YG 


t=1 j 


i bs C; bs B;(b:)4e%(P2—6) (5-43) 
i= a 


a; (b;) 408-7) 


ibs 


int i i 
Mh ; b egrating g(r \oct T) without consider ing 


F(t) = folt) + J Ajee-™ + YY Bieber (5-44) 


Comparing Eqs. (5-43) and (5-44), 


Ms 


j b; i i= 4 = 
Ao a;(bi)°C At a= 1,2, ,m 
Boece ta 5-45 
>» Bi(dMCs a B; 1 Pine i, 2, eee} m 


Mince the m A”’s ar i 
, i e arbitrary, at least m a,’ ‘ 
(0-15), The same holds for the Oi ere ee 


n>m-—1 n’'>m—1 (5-46) 
ay Md 
® summarize the above: The solution of Eq. (5-36) is given by E 
c y Eq. 


(hed 1), ituti 
vil), By substituting D(p)f(t) for g(t) in Eq. (5-36) and integrating 


(¢) is obtai : i 
o obtained, By comparing fo(¢) and f(0), the coefficients A; and B 


__ ___ "dee 
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are determined. The coefficients a; and B; i eases 115 
: a; and B; are then determined from It follows from Eq. (5-49) that 

Eq. (5-45). 

If Eq. (5-36) is the only condition for determining g(t), the equality D(p)filt) = D(p)fr(t) = 0 (5-51 
signs of (5-46) may or may not hold. In the latter event, a;’s and ;’s lim f.(t) = 0 -51) 
are not uniquel determined. However, this ambiguity can be removed er 

aery ed lim f.(t) = 0 (5-52) 


by a physical consideration: In Fig. 5-3, the mean-square value of the 
output function c(t) is 


4200) = fp’ fo, 9O9 bank = 7) A ar (5-47) 


Let x(t) be defined as follows: 


. my Tist<T, 

x = 1(t) b= T, 

x(t) = felt) i> 0s i 
The integral equation 


It is convenient to discuss the convergence of the above integral in terms 
of Laplace-transformed functions: ®,,,(8) is of the order 1/s”” for large 
value of s. ®cc(s) is of the order of 

1 1 


ee g2N ee 
g?m g2(m—N) 


i: g(r) o(t — 7) dr = x(t) —0 <t<o (5-54) 


is then equivalent to Eq. (5-36), as will be shown presently. Since the 


limits are extended to infini 
nit jouiti ars, 
mR eres y, the ambiguities at the limits no longer 


g(t) = D(p)x(d) 5-55 
ISquations (5-51), (5-53), and (5-55) give 


where N is the larger one of n,n’. In order that ®.-(0) be bounded, ®ce(8) 
must be of the order of 1 /s? or smaller as $ approaches infinity. Therefore 


n<m-—l1 n<m-1 (5-48) 


Considering inequalities (5-46) and (5-48), we have 


n=n=m-1 Therefore, Eq. (5-54) is reduced to 
There is only one solution for g(t) that satisfies the above and is therefore 
physically feasible. 

While g(t) can be obtained by the brute-force method outlined above, 
the computations are rather tedious. A much faster method will now be 
derived: The anomalous situation ati = Ti andt = T. can be removed 
by extending the range of integration. Equation (5-36) holds only for 


|r. 9) $(t = 1) dr = 2) (5-56) 


We note that, for the 
aos range 7; <t < T2, Eq. (5-56) is the same as Eq. 


In the neighborhood of ¢ = 7;, Eq. (5-55) can be written as 


) the interval Ti <t < T2. Whatever value f(é) may take outside #) =' Dt = 

range has no effect on g(t). Let us define f(t) and f2(¢) as follows: ‘ e I ) (p) D~(p) x(t) (5-57) 
| sigs 3a ole eA 8a T 

| f= (p—) = VA I (Deatet0s) 


| m 

| f= Y Ce™ 

\ 1 » the real components of the b,’s aren = x 
in D(p). egative, and A is the coefficient of p’” 


Lot us determine the function 


fa) = Y, Cie 
i=1 


y(t) = D-(p)x(0) (5-58) 
Irom qs. (5-49) and (5-53), 


The constants C; and Cj are selected to match the function f(t) and i 
first m — 1 derivatives at the two ends: 


f(T) = f(T) *\ (ein 


i= 0,1, “ 
f(T) = LECT) ¢=0,1,2,6++.™> 1 y(t) = 0 i< 7, 


VW) = DPQ t>T% (5-59) 


——————————<< Te 
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From Eg. (5-50), (t) and its derivatives up to the (m — 1)st order are 
continuous at t = Ti. Since z(é) is differentiated, at most, m times, 
there can be a step function but not a 6 function at ¢ = 0. Considering 
Eq. (5-59), we have 


y(t) = ut — Tr) D-(p)f) (5-60) 
g(t) = D*(p)[ut — Ts) D-(@)FO! (5-61) 


where u(t) is the unit step function. Similarly, in the neighborhood of 
‘= T 2, 


g(t) = D-(p)(u(T2 — )D*(p)fO) (5-62) 


Equations (5-61) and (5-62) give the 6 functions at t = T1 andt = T2 
directly from D(p) and f@. 

Case 2. Both N(s) and D(s) Are Real Polynomials of s’. Let us 
denote the roots of N(s) by a,7=1,2,---, 2n, where 2n is the order 
of the polynomial N(s). From the fact that $-c(0) must be bounded for 
any physical system, we obtain the following conditions: 

1. The order of the derivatives of the 6 function att = 7, andt = Ts 
must not be greater than m — n — i. 

2. If m — n = 1, only 6 functions are allowed. 

3 Ifn=m+r, theng®(Ti) = g®(T:) = 0,4 = 0,1, .- - ot — i 

These conditions can be realized as follows: Let us consider the integral 


equation 
_ = f () v at — 
[zr atr)outt — 2) ae = HGpy + p) Ken =f) 6-68) 
en ee 
where gr(t) = & 5 all 


is an even function of t. The constants K; are selected so that if 
define 
y(t) = D-(p)fr(t) 
y(t) = D*(p)fr® 
then yiO(Tr) = y2O(Tr)) =0 += 0,1,2,..-507 1 


Equation (5-63) is obviously of the form discussed in the 
section. Its solution is given as 


g(t) = D(p)fr® Pt oT 
= D+(pyut— Tim) t= Ts 
= D-(p)u(T2 — yl) t= Tr 


Owing to Eq. (5-65), the boundary conditions at the ends are satis 
Operating on Eq. (5-63) by N(p), we obtain Iq. (5-36). Therefore 
of Bq. (5-66) is a solution of Wq. (5-86), 
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Example 5-2. The problem is the same as Ex 
5 ; ample 5-1 except f : 
onn(t) * Be-‘l7|, where b is a positive real constant. ae one 
Solution. The Laplace transform of ¢nn(z7) is 
@n,(s) = . dr Be~*I"le-s7 = _2bB (5-67) 
Equation (5-28) gives 


: Joo(r ) =0 
Equation (5-29) gives 


r 
I, goi(r2) onn(71 — 72) dt2 = 1 


T 
i. go2(t2)onn(t1 — 72) drg = T — 11 


Dini oe ke 
(p) Be 9 (p + 6) 
D> = ee me 
(p) Va 208 (p — b) 
quations (5-39), (5-61), and (5-62) give 
1 
guilr) = ap lb + (7) + A(T —7)] (5-68) 
1 
Jorx(r) = bB (b(7 —7r) + +07) 67) — 4(T —7)) (5-69) 
g(r) = Argor(r) + Azgo2(7) (5-70) 


Wquation (5-31) gives 
r iN eT 
r fy 90) @ = 35 OT +2) + MEO TD a1 
Es a Me 
[, o()(T — 7) dr = Fe (OT +2) + oe OT? + 86T + 3) = T +a 


Holving the above equations, we obtain 


Ar _ 12 — 2677? — 6bT(2 + bT)a/T 


2B” (2 + 6T)(2 + 6bT + b?7) (5-71) 
nee» 6 + 120/T 
20B «12 + 6b7 + b°T? (5-72) 


Mubstitution of Eqs. (5- m ns ' 
1 of Eqs. (5-68), (5-69), (5-71), and (5-72) in (5-70) gives the optimum 


If the noise bandwidth b is sufficiently wide, 67 approaches infinity 
nnd Iq. (5-70) gives the same result as Eq. (5-33). On the other haridh 
if the noise is of very low frequency, 6 approaches zero, and we have 


1 
ate) = 5 (0) + a7 ~ 1 + (5 +9) foe) — aC = 2) 


= 8(r) + 7 [0(r) = (T=7] (5-73) 
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Equation (5-73) can be interpreted as taking the readings at the present 
and at T sec ago and projecting the result linearly to a sec later. Obvi- 
ously, we shall not be able to filter out the low-frequency noise by such a 
procedure. However, as the system is required to duplicate a constant 
signal faithfully and as a low-frequency noise for a short interval T < 1/b 
cannot be distinguished from a constant signal, filtering is not possible. 

As a numerical example, let bT = 2, and a/T = 4; Eqs. (5-71) and 
(5-72) give 

M1 3 


‘hs oh A» _ 10 
ne... 


20B (28 
Equation (5-70) gives 


17 20g ee 


13 
g(r) = “Tt ~ 4? 4p 38 (7) — 58 a(T — 7) (5-74) 


Equations (5-73) and (5-74) are plotted in Fig. 5-6. Figures 5-4 and 
5-6a and b illustrate the step-by-step change in the optimum form of g(r) 
as the noise bandwidth b is decreased. 


To co To co 
1 
i] 


aint 


27 
B 6(7) 


0 T 7 
1 55( T-1) 
To oo 


(b) 


Fic. 5-6. Impulse-response function g(r) of an optimum system in the presence of 
band-limited noise. (a) bT = 2, e/T = ¥%; (b) bT = 0, a/T = . 


5-7. System with Finite Observation Time and Power Constraint Due 
to Fixed Output Element. Consider the problem of Secs. 5-3 to 5-6 
with the following modification: Instead of producing an idealized output 
c(t), which represents the best estimate of r(t + 7), and applying this sige 
nal to a separately designed servo follow-up system, the over-all system, 
including the power component, is designed as a whole. We shall limit 
our study to time-invariant systems. Let c(t) represent the output of 
the power component, 7(¢) the input to the power component, and g(r) the 
impulse-response function of the over-all system, In addition to 4 
requirements that dee(0) be a minimum and Eq, (5-17) be satisfied, we 
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require further that $::(0) be limited in amplitude. The function 7(¢) can 

be represented as a linear differential operator H(p) operating on c(é) 

Given the transfer function of the power component as G,,(s), H(p) o 

einaplgeay/ Gale: pana 
Let x(t) represent the total input: 


a(t) = r(t) + nt) + E@) (5-75) 


If we assume that £(é) is uncorrelated to the random i 
inputs r(¢t d 
Eq. (5-75) leads to the following: eb haee 


z(t) = brni(t) + dee(7) (5-76) 


In determining ¢::(7), the statistics of the unknown coefficients are taken 
into consideration. ‘To illustrate what we mean, each segment of y(é) of 
Example 5-1 can be represented by y(¢) = a + ait. From the assump- 
tion that y(¢) is continuous at each switching and from the given statistical 
data of a1, ®,,(s) is calculated, as shown in Eq. (5-35). 

By definition of g(r), 


c(t) =f," g@r)xtt — 2) dr (5-77) 
Operating H(p) on Eq. (5-77), 
i = H(p)elt) = f," oH (pelt — 1) dr (5-78) 
The operator p in Eq. (5-78) denotes d/dt. Since 
d 
fat 2) = — Lat -7) 
iq. (5-78) can be written as 
iW) = f° 9(r)H(—pi)2t — 11) drs (5-79) 


where p: represents d/dr;. From Eq. (5-79) we have 


* x ¢t 
OP = fy fy 9@rs)oCrs) (pi) H (— pa)a(t — ru)a(t — 12) dra dre 
‘Taking the ensemble average of the above equation results in 
ce 
bu(0) = ff, 9(rv)g(r2)H(—ps)H(—p») beara — 173) dra dre 
7 7 TF 
= fo fy 99G)H(—ps)H (rr) bec(r1 — 72) dries (5-80) 


ty Lagrange’s undetermined multipliers are used, the problem of mini- 
Miving d(O) while limiting (0) is equivalent to minimizing $..(0) + 


————————<&<& Te 
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k2g:(0), where the unknown coefficient k? is determined later by the con- Equation (5-29) becomes 
dition that ¢::(0) is less than or equal to a given limit. From Eqs. (5-10) i T/2 
and (5-80), we see that the solution of Sec. 5-5 still holds except that, in -—T/2 Bo(v)o(u — v) dv = 1 (5-84) 
all the integral equations T/2 
os / agp SHOE — 9) = =e (5-85) 
$(7) = dnn(t1 — T2) + k?H (— pi) H (pi) dzx(T1 — 72) (5-81) Gov) = ArGor(v) + ArBo2(v) (5-86) 
: : From Eq. (5-84), it is readily seen that i : . : 
is used as the kernel instead of brn: tunes Geka ae. Gs ae goi(v) is an even function while go2(v) is an odd 
Example 6-3. The problem of Example 5-2 is modified by an additional constraint T/2 
on the rms input to the power component which has as its transfer function K [s A / 7 go(v) dv =1 (5-87) 
Determine the optimum form of g(r). r/2 if 
Soluti = f 
olution r re * r/2 Box(v)(—v) dv => +a (5-88) 
® = Priv =. Dex * 
(s) 4 ee on “ 5 The eae of Eqs. (5-84) and (5-85) is given by Eq. (5-66). Since n =m = 1 
2b kes? 2 2 here are no 6 functions at the two ends; ho Ea. (5- . pe 
Finite LR leo + —s(p? — =| Iq. (5-84) we have meee, Sa Fe hae to beeneynom: ae 
2bBk? K2/k? — s? k2A2 


ReneS 7B 1 

KK? "6? — 8? K2(0? — 8?) jae at C, cosh au 
For the sake of simplicity, let us assume that the limit on the rms input to the power 
component is set just right for K/k to be exactly equal to 6. The above equation 
becomes 


he constant C; is determined by the boundary condition at u = 7'/2 only, since at 
u = —T/2 the boundary condition is automatically satisfied because of symmetry; 


A vif T 
2B a? — s? Cia sinh + +0 aT ov 
&(s) = no Coke a 5 (5-82) 1 5) + Cw cosh 5 s 0 
es Let 0 = aT /2; C, can be written as 
where a? = »? + A?/2bB. The constant 2B/b can be disregarded since it does not 


: ‘ a /a® 
influence the solution. We have D(p) = p? — v2, and N(p) = p? — a: Ce ee 5-89 
We can proceed exactly as we did in Sec. 5-6. However, when N(s) is dependent on Wquation (5-66) gives a sinh @ + » cosh 6 ( ) 
s, it is usually desirable to make use of the symmetry properties of the problem. We 
shall make the following modifications. AD Ze 
Zulu) == + Cia? — v? 
1. We shall use the new variables u, v defined as wu = 71 — T/2, 0 = 72 — T /2, one) a? > Cte v*) cosh au (5-90) 
The limits of the integrals become + T/2. Let g(u) be defined as Mimilarly, Boo(u) = — ? i lath sac AN ely te ‘eet> 
a? ne u 1 
g(u) = g(r) (5-83) C= 2+ rT 
2a2(a cosh @ + » sinh 8) (5-92) 
2. Equation (5-31) is not changed by replacing fi(¢) with a new set f;(¢), where Vrom Eqs. (5-87) and (5-88) we obtai 
3 ss ain 
fi) = 2. aiifi ©) NA, ANE So eek oA AP 
j v7 + 2aC,(a? — v?) sinh 6 (5-93) 
as long as the determinant ai; ~ 0. —— 6a?(T' + 2a) 
Instead of using 1 and ¢ as the two known functions, we shall have vT* + 24C.(a? — v?)(@ cosh @ — sinh 6) (5-04) 
. A . 
nw) <1 4 a numerical example, let a7 = 2, and »T = 14. Then 
Hm eS Boi(u) = 0.0625 + 0.150 cosh aw 
F r 2 Zo2(u) = —-0.0625u — 0.3197 sinh au 
Consequently, f(T —n) = T-3gons= —4 om 4.19  )) _ 13.03 


i aie a, - 


r | € € a 
AT +a -3te Bolu) = (0.202 — 0.815 77, + 0.628 cosh au — 4.15 sinh au) (5-95) 
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Making the substitution u = 7 — T /2, Eq. (5-95) becomes 


1 Le ar aT 5-96 
g(t) = F (0.670 — 0.815 7 — 0.648¢°" + 6.50¢ ) (5-96) 


Equation (5-96) gives the optimum impulse-response function for the over-all system 
and is plotted in Fig. 5-7. Comparing Figs. 5-6a and 5-7, we note that one effect of 
the power limitation of the fixed component is to round off the impulse-response 
function. 


Fria. 5-7. Impulse-response function g(r) of an optimum system with constraint on the 
rms input to the power component. 


5-8. Solution of Nonstationary Problems by Eigenfunctions. Let us 
consider the general nonstationary problem of Secs. 5-3 and 5-4. For 
any given t, go(t,t1) is to be determined as a function of t; by solving the 
integral equations (5-15) and (5-22). Both equations are of the form 


te a(t’) K(t,t’) dt! = f@® (5-97) 


where f(t) and K(t,t’) are known functions, and Kit’) = Kb). Equas 
tion (5-97) can be solved if the set of eigenfunctions associated with 
K(t,t’) is known. The eigenfunctions are defined by considering the 
following equation: 


B (UK (tyt") dt! = a(t) (5-98) 


where HZ is a constant. Equation (5-98) is known as a homogeneous 
Fredholm equation of the second kind. There is always the trivi 
solution x(t) = 0. However, for a discrete set of values of E, there 
also nontrivial solutions. These values of F are called eigenvalues a 
* will be denoted by H;, BH: ...,Hn,..- Associated with each cige 
value F,, there is a nontrivial solution of Eq. (5-98); it 1s called its eige 
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function. Obviously any solution multiplied by a constant is also a 

solution. A normalized eigenfunction u,(t) is defined as one which 
satisfies 

E, is Un(t)K (t,t) dt! = un(t) (5-99) 

fe [un(t)|2 dt = 1 (5-100) 


The kernel K(i,t’) is called nondegenerate if, for each E,, there is one 


and only one w,(¢). When K(i,t’) is symmetrical in ¢ and ¢’, the set of 
eigenfunctions are orthogonal to each other: 


hs Un(t)Um(t) dt =0 ifnXm (5-101) 


Equation (5-101) is easy to prove. Consider the following integral: 


ae the He ” tun(t) Um (EK (t,t”) dt dt! 
If it is integrated first with respect to t, the result is 


pa Sry ong eye 
Oy Ne Pitas cs 


If it is integrated first with respect to 2’, the result is 


eae 
J= al Un(t)Um(t) dt 


‘Therefore Eq. (5-101) must hold. 


The set of orthonormal eigenfunctions w,(t) is called complete if, for any 
arbitrary function f(t) defined in the interval a < t < b, a set of coef- 
ficients a, can be found such that 


FO = Y anun(t) (5-102) 


Now we return to Eq. (5-97). If we assume that K(t,t’) gives a com- 
plete, nondegenerate set of orthonormal eigenfunctions u,(t), x(t) can 
be expressed as 


a(t) = ) batin(t) (5-103) 


Substituting Eqs. (5-102) and (5-103) in Eq. (5-97) and making use of 


liq. (5-99) result in 
yt m0 = Dy aanatd (5-104) 


n 


Multiplying Eq. (5-104) by u(t) and integrating, we obtain, for all m, 
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bm = OmEm. Therefore Eq. (5-103) becomes 
a(t) = ¥ anE atn(d) (5-105) 


As dm, En, and un(t) are known, Eq. (5-105) is the poesia of a 

Ras she set of integral equations (5-15) and (5-22), t and : ra ips 
limits of integration, and ¢; and tz are the independent Naan ie 
ble of integration. The kernel rr, (t1,t2) and therefore the ange a 
depend on the parameter ¢. Let the set of eigenfunctions “ prraein 
Un(ttr). Let fo(ts) denote dr,v(ts,t), and coefficients a;;(t) be 


cc) 


} -106 
Sits) = » ai s(t) uj(t,t1) aac 0, AIF 2, Sy EN ie: | N (5 1 ) 


Owing to the orthonormal relations (5-100) and (5-101), 
a(t) = : * fe(tsyus(tyta) dts (5-107) 


Equations (5-15), (5-22), and (5-105) give 
goi(tyts) = ¥, asOBiOQus(th) (5-108) 
7 


ll 


. = -109 
go », de(t)goi(t,t2) ho = 1 (5-109) 


The coefficients \,() are to be determined. 
From Eqs. (5-18), (5-106), and (5-108), 


, y di(t)aij(t)axy(t) Ex(t) = > axj(t) hav uj(t,ta) h(t,t1) dty 
i=0j=1 43 k=-1,2,.-.,N (oe 


ear equations in \,(t) from which the 


; 3 nts N lin 
Equation (5-110) represe N, can be solved. 


ns \,(f),4 = 1,2, .--> : 
beers hee formal solution of the general nonstationary 


R sx oa 
roblem. Its applicability is strictly limited because = it a 
eattiins un(t,t:) is usually difficult to Reven kg he tical 
imi umber of kernels are these function a 
ee eos of the problem warrants, one may determine ie os 
functions by the Miller and Zadeh method. bis reader is re 
i thod. 

-» original paper for a treatment of this me ’ toll 
percent In this chapter the least-square aynthesie teeta 
using time-domain variables is formulated in _— aur e fe ik 

i ion time may be limited, there 
nonstationary, the observation m 
ee sausieis on the input to a fixed component, and, in addition 
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the least-square-error requirement for random inputs, the system may 
have to predict without error certain inputs which can be described as a 
linear combination of known functions with unknown coefficients. For 
a system with stationary random inputs and possibly limited observation 
time, a complete solution is given. For nonstationary systems, a solu- 
tion is given in terms of the eigenfunctions which are obtained with the 
correlation function of the random inputs as the kernel. However, this 
solution is only formal, since the eigenfunctions are generally not known. 

A correlation function ¢10(é1,t2) is defined in terms of the ensemble 
average, and its reduction to the stationary case is shown. Integral 
equations which give the impulse-response function g(¢,t:) of the optimum 
system in terms of the correlation functions of the random inputs and 
the above-mentioned known functions are derived [Eqs. (5-15), (5-20) 
to (5-22)]. 

A running prediction is possible with the time-invariant optimum sys- 
tem function, if the random inputs are stationary and the set of known 
functions satisfies certain conservation conditions [Eq. (5-23)]. A com- 
plete solution including possible 6 functions at both ends is given in terms 
of differential operators [Eq. (5-66)]. The presence of a fixed component 
together with its power constraint can be taken into consideration by 
modifying the kernel of the integral equations (5-81). Its practical effect 
is to reduce the order of discontinuity of the impulse-response function 
(Examples 5-2 and 5-8). 

Kigenfunctions and eigenvalues of a homogeneous Fredholm equation 
of the second kind are defined, and their properties, including complete- 
ness and orthogonality, are briefly discussed. A solution of the integral 
equations for the general, nonstationary case is given in terms of the 
eigenfunctions (Sec. 5-8). 

Besides the applications discussed in the main body of this chapter, 
(he ideas and mathematical methods developed here can also be applied 
to a large class of problems in statistical communication. Among these 
ure optimum detection of signal in noise and optimum signal waveform 
for transmission through a noisy channel. Readers who wish to pursue 
(his subject further will find useful the reference-material section at the 
ond of the book. 


CHAPTER 6 


OPTIMUM DESIGN OF SAMPLED-DATA SYSTEMS 


In the present chapter we shall study a class of control systems that 
receive intermittently monitored signals at regular intervals to control a 
continuous plant. The subject matter being treated is limited to opti- 
mum-design techniques of such systems, and the reader is assumed to 
have some familiarity with the usual nomenclature for sampled-data 
systems and the z-transform method. 

As is generally known, the errors between sampling instants are far 
more tedious to calculate than the error at sampling instants. For- 
tunately, no such difficulty exists when an integrated criterion is used. 
In the following sections, we shall discuss the various techniques that 
enable one to design an optimum system, in a prescribed sense, which 
gives equal consideration to error at all times, between as well as at the 
sampling instants. 

6-1. Criteria for Optimum Design. The design theory for sampled- 
data systems is almost an exact parallel to that for continuous systems, 
The criteria depend on the nature of the inputs: (1) optimization of tran= 
sient response to deterministic inputs and (2) minimizing of mean- 
square error in response to random inputs. 

In the case of mixed inputs, e.g., step input mixe 
the design procedure is the same as that for random inputs. 

As discussed in Sec. 3-2, with random inputs, the distribution function 
of the system error has a tendency toward approximating the normal dis 
tribution, and the mean-square value of the error is most significant in 
various applications. However, such is not the case if transient response 
to deterministic input is of primary interest. Two different criteria of 
optimization will then be discussed: (1) zero transient error in the short« 
est time and (2) least-integral-square error. 

Control systems designed according to criteri 
as deadbeat systems. They have the shortest settling time. 
(2) is an obvios extension of the least-mean-square-error cri 
random inputs. 

As mentioned previously, there is a close parallel between the desig 
theories of continuous systems and sampled-data systems. The simila 


ties are better exhibited by pointing out a few differences at the 
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d with random noise, 


on (1) are generally known 
Criterion 
terion with 
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Li 

: ee iovieeschire: system, the term “deterministic inputs” has 

g ricter implication than for a contin i i 

obvious exception of step i i ae seni 

p inputs, the starting tim t b 1 

tion to the sampling instant. O mace teeta at —— 

: therwise, the input 
sented as a random combinati eae ie Hy Ne mee 
ion of deterministic i 
treated as a single deterministic i one ce a 
nistic input. As it is in i 

external source can be s i os dae eee 
ynchronized to the sampli f 

the above consideration poi Mot iiss tage 
points to the necessity for certai i ict 

when unsynchronized deterministic i A a he 

istic inputs are involved i 

— = be discussed along with the main theory 1 a 

Poe Beate systems without constraint, the system with the 

‘ settling time as well as the system with the least-square error has 


K (z) 
3 oe Gz) 
e3 (t) 
an) Y 9s | 


Processing units 


I Ia. 6-1. Block diagr am show ing definitions of tr ansfer functions and sy stem variables 
ina sampled-data system. 


oa its closed-loop system function. For sampled systems, the solution 
8 by no means trivial. We may reason the di 
e difference as foll iy 
sampling process itself introduces an ae 
: process average delay of half i 
period, which is, in effect, a functi i ae 
; ; ; onal constraint placed 
Ass 1 18, placed upon the system. 
Brinson a the asreens System Function K(z). Before pro- 
ing t m-design techniques, we would d ll i 
restrictions on K(z), which i "| 7R i Sone 
sar (z), which is the equivalent of C/R in a continuous 
mild. nt = illustrates a typical sampled-data system. The transfer 
bea jons G(s) and G(s) of the controlled plant and hold circuit ear 
3 LP Basia The data-processing unit D(z) which cecnien on 
oe i © error signal er (t) is to be designed. Sometimes it is desira- 
ode “A roduce an auxiliary processing unit D(z) which operates on the 
pled output variable e*(¢), as shown by the broken line. D(z) and 
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D.(z) take the places of series and shunt compensations, respectively, 
éf a continuous system. In practice, D(z) and D2(z) can be built into a 
single digital processing unit. The input functions are either r*(t) and 
c*(t) or eX (t) and c*(0), and the output is eX(é). Especially in the former 
case, we may analyze the system as one with z-dependent main-loop 


e(t) 


0 2T 4T 6T 8T 10T t 


Fra. 6-2. Response of hold systems. Circles, sampled input to hold systems; solid 
lines, output of zeroth-order hold; broken lines, output of first-order hold. 


feedback rather than one with an inner loop. The equivalent feedback 
function then is 1 + D,2(z)/ D(z). 
The sampled system function K(z) is defined as 


Ee). ia (6-1) 


In terms of the control elements, 


= D(z)Gilz) (6-2) 
K@) = TF D.@Gie) + DOE) 
where G;(z) is the z transform of Gi(s) = G,(s)G,(s). In most applica- 
tions, @,(s) is a rational function of s multiplied possibly by a factor 
e-#* representing transportation lag; G@,(s) usually represents a zeroth= 
order hold system: 


PA ARR (6-3) 


s 
Sometimes a first-order hold system is used; then 
Bish Ol eee 


a) - =o 4 Go (6-4) 


The input-output relationship of a hold system is illustrated in Fig. 6-2, 
A zeroth-order hold system holds the sampled value for a single sampli 
period 7. A first-order hold system gives the extrapolation of the tw 
preceding sampled values. It cannot give an interpolation of the twi 
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nearest sampled values since no physical system is capable of utilizing 
future data. 

With G(s) and G,(s) of the above description, G,(z) can be written as 
V(2) 
U(z) 


where U(z) and V(z) are two polynomials of z2~! written in the order of 
rising powers of z~'. Without losing generality, the first term of U(z) is 
assumed to be 1. The first term of V(z) is, in general, z_™ because of 
possible transportation lag, and m is the number of sampling periods that 
have elasped before observing the first nonzero sampled output. Some- 
times there are roots of V(z) outside or on the unit circle. Let these roots 


Gi(z) = (6-5) 


be denoted by \x, 7 = 1,2, .... V(z) can be expressed as 
V(z) = Vile) V2(z) (6-6) 
l 
where a. Viz) = 2-™ [J (l — Ae) (6-7) 
i=1 


and V,(z) is a polynomial in rising powers of z~ with all its roots inside the 
unit circle. The first term of V;(z) is a constant term, by definition of m. 

In the literature on sampled-data systems, rational functions of z are 
usually written as a ratio of two polynomials in z~!. However, when one 
refers to “roots,” “poles,” and “zeros,” the “roots” of z and “poles” 
and “zeros” on the z plane are usually meant. The apparent confusion is 
resolved once we know the above-mentioned rules. On the other hand, 
these rules are rather convenient for recognizing possible transportation 
lag ina system. If a certain sampled transfer function implies negative 
lag or anticipation, it obviously does not represent a physical element. 

Similarly to the continuous case, if a sampled network function F(z) 
is to represent a stable physical system, it must meet either of the follow- 
ing conditions, which are equivalent to each other: 

|. F(z) is expressible as a series in z~!, without the presence of any 
positive powers of z, and the coefficient of z~" approaches zero as n 
approaches infinity. 

2. F(z) is expressible as A(z)/B(z), where A(z) and B(z) are poly- 
nomials in gz. The first term of B(z) is 1, and all the roots of B(z) are 
inside the unit circle on the z plane. 

We note that condition 1 is similar to the requirement on the impulse- 
response function of a physically realizable continuous system, and 
condition 2 is similar to requiring that all the poles of F(s) be inside 
the LHP. These conditions are referred to as conditions of “physical 
realizability.” 

l’or a system to be closed-loop stable, K(z) must meet these conditions. 
Hlowever, they are not sufficient. The factor V.(z) represents the trans- 
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portation lag and nonminimum-phase effect of the fixed component which 
includes both the plant and the hold system. If V.(z) is not to appear 
as a factor in K(z), the manipulated signal e¥(t) would have either to 
anticipate r*(t) or to rise exponentially following any initial disturbance. 
The proof is similar to that for the continuous system as given in Sec. 
4-1. The restrictions on K(z) can be summarized into the following 


equation: 
K(z) = V2(z)F (2) (6-8) 


where F(z) meets the condition of physical realizability. 

Equation (6-8) is equivalent to a set of conditions first derived by 
Bergen and Ragazzini. They based their deductions on the impossibility 
of canceling exactly a zero in G,(z) by a pole in D(z). While a slight dis- 
location is not overly harmful when the unintended dipole pair is inside 
the unit circle, it causes instability if the dipole pair is outside. Using the 
same argument, with reversed roles of pole and zero, Bertram derived a 
condition on K(z) for a system with an unstable plant: For a series- 
compensated system with unity feedback, any pole of G,(z) outside 
the unit circle must appear as a zero of 1 — K(z). As this restriction 
can be removed by either z-dependent feedback or shunt compensation 
[D.(z) ¥ 0], it will not be considered in the following sections. On the 
other hand, our derivation of Bergen and Ragazzini’s condition is inde- 
pendent of system configuration. Equation (6-8) must be observed no 
matter how we propose to compensate a system. 

6-3. Design of a Deadbeat System.} If the reference input r(t) can 
be represented as an nth-order polynomial in ¢, then 


Cee eas as (6-9) 


where Q(z) is a polynomial of 2’. If K(z) satisfies 
1— K(2) = (1 — 27)" We) (6-10) 


where W(z) is a polynomial in z~', the sampled error ef vanishes after @ 
finite number of samples. This is easy to prove. Since 


E,(@) = R@ — C@ =[1 — K@]IR@ = P@WE) (6-11) 


the number of nonvanishing sampled errors is equal to the order of the 
polynomial Q(z) W(z). 


+ Parts of Sees. 6-3 to 6-9, inclusive, are reprinted from the following two papers by 
the author: Statistical Design Theory for Strictly Digital Sampled-data System 
Trans. ALEE, pt. 1, vol. 76, pp. 702-709, 1957, and Statistical Design Theory 
Digital Controlled Continuous Systems, Trans. AILEE, pt. 11, vol. 77, pp. 191-201 
1058, 
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We note that K (2) must satisfy Eq. (6-8) in addition to (6-10). W(z) 
being a polynomial implies that, F(z) being also a polynomial, Eq. (6-10) 
can be satisfied by selecting F(z) in such a way that 


Ra) ='1 
K®(11)=0 i=1,2,...,” (6-12) 


There are n + 1 simultaneous equations in (6-12). In order to satisfy 
these equations, the polynomial F(z) has a minimum of n + 1 terms. 
Therefore F(z) is of at least nth order. We note that a lower-order F(z) 
implies a lower-order W(z) and consequently fewer terms of sampled 
error. The system with an nth order polynomial F(z) in 27? is called a 
Pi design. It reduces the sampled error to zero within minimum 
hime. 

However, reducing the error e* to zero is not the same as reducing e to 
zero. For the latter, the requirements are stricter, as expected. Since 
r(t) is a polynomial in ¢, and c(t) = r(é) for all t > t, the input to the 
plant e2(t) must be a polynomial in ¢ for all ¢ > ti. It follows that e2(¢) 
cannot be discontinuous. The order of e¥(#) must not exceed the order 
of the hold system. For instance, with a zeroth-order hold, if ef (t) is a 
constant, é2(t) is the same constant. Otherwise e2(t) is a series of steps. 
With a first-order hold e¥(¢) is allowed to be a linear function of time, etc 
I).(z) can be expressed as igh 

_ K(z@)R(z) _ K(z)U(z) RE 
Ee) = “FQ = Kenn (6-13) 


From the above discussion, we see that the following conditions must 
be satisfied : 

1. With a kth-order hold system, H2(z) is allowed a (k + 1)st-order pole 
at z= 1. Since K(z) cannot have a zero at z = 1, Eq. (6-13) implies 
that U(z) has a zero at z = 1 of at least (n — k)th order. In other words 
(/,(s) must have a (n — k)th-order pole at s = 0. 

2. Since e2(t) is a polynomial in t, Z2(z) cannot have any pole other than 
1, But none of the roots of the polynomial V(z) is at 1. Thus K(z) 
must be divisible by V(z). Instead of Eq. (6-8), we have 


K(z) = V(2)F(2) (6-14) 


3. Equation (6-10) is required since the sampled error ef (t) must be 
yoro after a finite number of sampling periods. 

Condition 1 is a condition on the controlled plant. If it is not satisfied 
n deadbeat design is simply not possible. If it is satisfied, we can “ae 
liq. (6-12) to determine F(z) as before. The system with an nth-order 
polynomial F(z) (in z~!) is called a prototype design. It reduces the 
error e(t) to zero within minimum time. 


eee 
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Equations (6-10) and (6-14) and condition | are the necessary conditions. 
In order to be sufficient, another condition on the plant has to be added to 
the group. That is the following: 

4. None of the roots of G,(s) are at a distance j(2r/T) from each other 
on the s plane. 

To prove this, let us consider the state of affairs after a time interval 
ti, which is sufficient for both of the following to have occurred: 

1. Asa result of Eq. (6-10), the error ex(t) = Ofort > fi. 

9. Asa result of conditions 1 and 2, e.(t) can be expressed as a poly- 
nomial in ¢ for t > 1. 

Let ! =t—t. The output function c(t’) consists of a polynomial in 
t together with a number of transient terms e*’, where the a;’s are the non- 
zero poles of Gi(s). The input r(t’) is a polynomial in ¢’. Since 


e(t’) = r(t’) — e(#’) 


e:(t’) can be written as 


ex(t’) = ), Beli + ¥ Ae (6-15) 
j=0 i 
The z transform of e:(¢’) is 
C; Ai 
Bi) = Fs Germ t >, Tete Cit 
j=0 i 


Owing to condition 4, none of the poles e*” are identical. As e¥(é) = 90, 
E,(z) = 0. Consequently, Cj = B; = A; = 0 for all z and j. Equation 
(6-15) gives ex(t’) = 0 for t' > 0. We note that condition 4 is necessary 
in the proof. Ife"? = e””, we have the possibility that Ai = — Az #0. 


Physically, it represents a transient term which vanishes at every Same” 


pling instant. 

There are two remaining points to be discussed: 

1. The first is the matter of unsynchronized deterministic input, 
Fortunately, deadbeat systems do not require synchronized inputs; 
Only the order of the polynomial r(t) enters into the design equation, 
Since an nth-order polynomial in ¢ — f: is a different nth-order poly- 
nomial in ¢, the same system results no matter which polynomial is used, 
However, the coefficients of the polynomial do have some effect on the 
required number of sampling periods for reducing system error to zero, 
The order of the polynomial Q(z) of Eq. (6-9) depends on the coefficients 
of r(t). The lower the order of Q(z), the faster the system arrives at the 
errorless state. 

2, While a prototype design requires the least time to reduce sys 
error to zero, it is usually not the best design. For instance, plant satu 
tion is a consideration, and a prototype design tends to force the sys 
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by sending large inputs to the plant. This is especially true in a system 


with high sampling rates. One wa i 
; y to modify a prototype design is 
use the following sampled system function. ot Shore 
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N 
K(z) = Ko(z) + V(z) ¥ Ad — 2-4 (6-17) 
i=1 

We note that, with arbitrary constants A;, Eq. (6-17) satisfies both Eqs 
(6-10) and (6-14). The settling time of the system is increased b WV 
sampling periods. However, by properly choosing the N piri A; 
the coefficients of H2(z) (as a polynomial or series in z~!) can be ver Be: 
stantially reduced. Alternatively, the A,’s may be determined of eu 


other criteria, such as least-s 
-square error, least-square error with limi 
mean-square input to the plant, ete. io 


Ita. 6-3. System error and mani i 
Ee ae ee anipulated variable of a deadbeat system subsequent to a 


Example 6-1. For the system of Fig. 6-1, let T be the sampling period, and 
10/T 
G,(s) = —— a 
o(8) = q+ Pep 
R@) =+ 
8 


D(s) is to be designed to give zero error in the least time 
Solution. l 


The transfer function for the plant and hold system is 


ou. ka 
8 Ts(1 + 7's)? 


Correspondingly, G(z) = eG + 3.8ta DO 50.108) 


(1 — a71)(1 — 0.36827") 
V(z) = e7*(1 + 2.34e7')(1 + 0.1627") 
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Since n = 1, we write 
K(z) = (co + 12) V@) 


The coefficients co and c: are determined by Eq. (6-12) with the following results: 


co = 0.732 
* ¢, = —0.473 
Ko(2) = 0.7322-1(1 + 2.34274)(1 + 0.162") — 0.6472) 
bw = 0.732(1 — 0.647271)(1 — 0.368271)? 
G — 2) + 1.10824) (1 + 0.1627) 


The system error e(t) and manipulated signal e2(¢) are calculated and plotted in Fig. 


6-3. 


6-4. Systems with Least-integral-square Error. The integral-square 
error following an input r(¢) is 


“ ee 
Pe CO = [o Bono a 8) 


where E(s) is the Laplace transform of the transient error e(¢). With 


reference to Fig. 6-1, H(s) can be expressed as 


E(s) = R(s) — C(s) = R(s) — R*(s)K*(s) ae (6-19) 


To evaluate J, the path of integration is divided into segments of length 
ws, a8 shown in Fig. 6-4. Let E(s) denote E(—s). Equation (6-18) can 


s plane <a 


Fra. 6-4. Diagram showing division of jw axis into length w.(w, = 2r/T). (From 8, 
L. Chang, Statistical Design Theory of Digital Controlled Continuous Systems, 7 
AIEE, pt. Ul, vol. 77, pp. 11-201, 1958.) 
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be written as 


is 5g ye £(s)E(s) ds 


1 oo 
= aS Fx ae E(s + jnws)E(s + jnw,) ds 

°T [jut 
= (EE)*(s) ds (6-20) 


a Qnj — (jws/2) 
The last equality sign is obtained by interchanging the order of summa- 
tion and integration. It is justified on the physical basis that |H(jw)|? 
must converge rapidly to zero at least as fast as 1/w? for large w, and 

? 


therefore the series x E(s + jnw,)E(s + jnw,) converges uniformly. 


n=—2 


In terms of the variable z, Eq. (6-20) becomes 
Leama 
Pi (6-21) 
The contour of integration is the unit circle. From E i 
can be evaluated as follows: +O ae 


E(s)E(—s) 


= | 2 — R*K*(s) G(s) 


GT(s) 


| RA —8) — R—-OKX(—9) G3 | 


™ R*(s) R*(— * *( 
= RE) R(—3) ey Gls)G(—8) 
_ R¥(s)K*(s) R*(—s)K*(— 
SAG BOR -# — OF  Al-OR) 


Applying the z transform to the above equation and noting that 2-1 = e-*7 
result in 


BE(2) = RR(e) + ee on G:G4(2) 
hn R(@)K(2) = af R(z-) K(z7}) - 
OR! GR ) — BEE GR) (6-22) 


Similarly, the integral-square value of an arbitrary system variable 
a(t) in the controlled plant can be expressed as 


Ja = i (a(t)]? dt = oF AA() de <M (6-23) 


where _A(s) = E¥(s)G3(s) - 
AA(2) = F(z) Ex(e)GsG5(z) ine 
= R(z) R(z-")K (2) K (27!) 


Gy (2)Gi(2~") 


GsG's(z) (6-25) 
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The transfer function G;(s) of Eq. (6-24) relates a(¢) to the manipulated 
variable e(t). 

As shown before, minimizing J. while holding the value of Ja below a 
given limit M is the same as requiring that 


Je+ kJ, = min (6-26) 


The functional constraint on K(z) is given by Eq. (6-8) in conjunction 
with Eq. (6-7). To facilitate further manipulations, we shall write V 2(z) 
1 


as (—1)'z-”’P(z) [] :, where m’ = m + land 
i=1 
l 


P(z) = I] (1 ses £) (6-27) 


i=1 
The functions G,(z) and K(z) can be written as 


Gilz) = 2" P)GulZ) (6-28) 
K(z) = 2” P(2)F(z) (6-29) 


where F(z) is an unknown function to be determined and G1;(z) represents 
the remaining factor of the transfer function G,(z). In general, both 
Gii(z) and F(z) are ratios of polynomials in z~ with nonvanishing con- 
stant terms. It is obvious that, if the constant term of any of the poly- 
nomials vanishes, m7' would not be the correct value of transportation lag 
in G,(z) or in K(z). Another condition is that all the poles of F(z) be 
confined to the interior of the unit circle, but not necessarily the zeros, 
while all the zeros of Gi(z) must be confined to the interior of the unit 
circle, but not necessarily the poles. 

The derivation of the optimum form of F(z) is similar to the procedure 
used in Sec. 2-4. Instead of s and —s, we now have z and 27. Instead 
of the imaginary axis, LHP, and RHP, we now have the unit circle, inside, 
and outside the unit circle, respectively. Let F(z) represent the dif- 
ference between an arbitrary admissible function F(z) (satisfying the 
conditions of physical realizability of Sec. 6-2) and the optimum function 
F,(z). The same steps from Eqs. (2-11) to (2-15) give 


af | ROREOPOPE” or rai F(z) 


— PYRE) Oe [re dz =0 (6-80) 


for arbitrary admissible F',(z~), as the necessary and sufficient condith 
for F(z) to satisfy Eq. (6-26). Since all the poles of admissible I'\(@) 
are inside the unit cirele, all the poles of /’\(e~') are outside the unit et 
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A necessary as well as sufficient condition for Eq. (6-30) is that the expres- 
sion in the brackets not have any pole inside the unit circle. Let 


py — GGi(2) + °GG3(z) 

¥(@)¥(-) = 2™ 3 

(z) (z ) G1(2)Gi(z-) (6-31) 
Z(2)Z(z) = RE) Re) (6-32) 

Y(z) and Z(z) are further characterized by the condition that all their 


poles and zeros be inside the unit circle. To illustrat is i 
: : e how this is done: 
The right-hand side of Eq. (6-31) is factored into ih ar 


[I] G - az) (1 — az) 
fe ee ee ee es 
[I] @ - Be) — Bz) 


J 


where |a;| < 1 and |6;| < 1 for all i and j. 
The function Y(z) is given as 


A Il (Qa! — a,z—1) 
igre Sania ef 
Ta - a se 


The reason that the factors are written as above is to avoid introducing 
extra poles on the wrong side of the unit circle. For instance, z — a has 
i zero at a and a pole at ~, while 1 — az—! has a zero at ayer a pole 
at the origin. The pole and zero of 1 — az—! are located inside the unit 
circle. It follows that all the poles and zeros of Y(z) are inside the unit 
circle. Whenever a root is larger than unity in magnitude, we write the 
factor as 1 — az, where a is the reciprocal of the root. The pole and zero 
of 1 — az are outside the unit circle. 

The function Z(z) is obtained by factoring R(z)R(z-!) in a similar 
manner. We note that, no matter how the poles and zeros of R(z) are 
located, all the poles and zeros of Z(z) are inside the unit circle. How- 
ever, Z(z) usually has poles on the unit circle. What do we do with these 
poles? Just as in the continuous case, we consider these poles as 
ipproaching the unit circle from inside in a physical sense. For instance 
lor a step input, we may write 


1 
se canal en (6-34) 


rhe small constant « may be the reciprocal of 1 million years. For all 
practical purposes, (z) represents a step input. When the location of a 
pole is significant mathematically, e-* is obviously a pole inside the unit 
virele, For all numerical computations, however, e~ is the same as 


———— xxx ww <<< hh 
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unity. We may write, instead of Eq. (6-34), 
1 
= ——— 6-35 
R@) = po (6-35) 


with the understanding that there is a factor e~*? ahead of 27}. 
Another reason for introducing Z(z) is to dispose of any factor 2 or 
z-!in R(z). For instance, for a ramp input, 


With the functions Y(z) and Z(z) as defined above, the condition for 
optimum F(z) is 


oF (2) 2 (2) (2) ¥ (e-)P@)Pe)Fole) — 2" P(e) RE") a 
= X(z) (6-37) 
Dividing Eq. (6-37) by Le) Y(z-)P(2), 
Ey oyera z”’-1P (2-1) R(z“)GiR(2) 
mPENZOY ORE) — PHY MMAMAE 
ae) (6-38) 


~ POOLEY) 


We note that all the poles of z-'P(z~!)Z (z) Y(z)Fo(z) are inside the unit 
circle and all the poles of X(z) /P(2)Z(z) Y (2) are outside the unit 
circle. Writing the remaining term into a partial-fraction expansion and 
collecting the terms in the expansion according to their pole locations, 


we have 


2.05 


2'—1P (2-1) R(z)Gi R(2) . A; B; . 
é tet peat ro (6-39) 


PRY ENZE)Ge") é, 
where |a;| < 1 and |b;| > 1 and W(z) is a polynomial in z. 
Q;(z) and Qo(z) are defined as follows: 


Qi(z) = Hs 2 An 


a 


a =) 2 + WO 


a 


Equation (6-38) can be written as 


PP) Le) Y@Fe) — Ol) = pazenyen + OA 


The expression on the left-hand side does not have any poles out 
the unit circle, while the expression on the right-hand side does not hi 
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any poles inside the unit circle. Therefore both expressions must be equal 
to a constant é, and 


P(z)Z(z) Y(z)Fo(z) = 2Qi(z) + ke 
A; 
= r perenne ater, (6-43) 
By definition, neither P(z!), Z(z), Y(z), nor F(z) can have a pole at 
infinity. Consequently & = 0, and Eq. (6-43) becomes 


Eole) = (6-44) 


1 2 A; 
Z(@)P(e) Vz) Ly 1 — az 


We note that the summation on the right-hand side of Eq. (6-44) can be 
obtained by “partial fractioning” the function 


z™ P(z-!) R(z“)G, R(z) 


U(z) = POYV]EDZE GE) 


7 bet ue bi A;/(1 — az) directly and collecting terms inside 
‘he unit circle. e shall denote this operation as [U(z E j 
(6-29) and (6-44) give oT St in eee 


Kee) = Fare AY | POY EE 


Z(z)P(z) Y(z) | Plz) Y(2-)Z (2) Ga (ze) ih (6-45) 
Let Y.(z) be defined as 
2™ P(e) 


Yi(z2) = Pe) 


Y(z) (6-46) 


quation (6-45) can be written as 


o 1 R(z-)G,R(z) 
Ky 
@) = 7aVw® Yile-)Z(z-)Gi(e-) l. 


(6-47) 


There is a simple graphical method for determini 
ng Y(z) and Y,(z). 
On the right-hand side of Eq. (6-31), the variable z appears in pairs of 
factors 

(1 — az )(1 — az) = 14+ a@ —- a(z +2) (6-48) 


Let Q = AG +27). The right-hand side of Eq. (6-31) can be written 
as a function of Q. If necessary, the function 1/Y Y(@) can be obtained 
hy a root-locus plot on the @ plane. Y(z) is then obtained from Y Y(Q) 
i shown by Eq. (6-83). Equation (6-46) gives Yi(z) by switching ‘ 
Humber of poles of Y(z) to their reciprocal values and also attaching the 
factor 2” which represents transportation lag. 


ee 
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6-5. Statistical Properties of Sampled Signal. For continuous random 
signals, we have two distinctive though equivalent representations of their 
| statistical properties, namely, spectral density and correlation function. 

We still have sampled spectral density and sampled correlation function 

for sampled signals; however, the two representations tend to merge. 
This is rather characteristic of the z-transform method. For instance, 
the z transform of a sampled signal i(n7’) is 


c) 


I(z) = ¥) i(nT)e™ 
The values of the time function i(n7) are simply the coefficients of the 


power-series representation of I(z). 
The correlation function of sampled signals 7; and 72 is ¢1.(nT). Their 
sampled cross-spectral-density function can be defined in two ways: 


z transform of ®2(s) 


n=o 
> o12(nT)z 
n=—o 
We note that, in order to use the first definition, the two signals must 
be continuous to begin with, so that their cross-spectral density #12(s) 
is defined. To use the second definition, the two signals may be sampled. 
It is only necessary to know ¢12.(nT), which may be calculated as 


gi2(nT) = i(mT )iox(mT + nT) 


where the averaging may be taken over m or the ensemble. Thus the 
signals may be undefined except at sampling instants. 

In case the two signals are continuous and stationary, we shall show 
that the two definitions are equivalent. Readers who are not interested 
in the mathematical details may skip the remainder of this section. 

Let &(s) be a real, rational function of s, with roots in both the RHP 
and LHP, and converging at least as fast as s~? as $s —> ©. Three dif- 
ferently defined (z)’s will be denoted as &,(z), ®:(z), and &,(z), respet= 
tively. We shall show that they are identical with each other. 

Case 1. Single-ended Series, ®.(2) 


(6-49) 


#.(¢) = >, a(n) 


i where galt) = a [ _ &(s)e" ds (6-50) 


| 
| | and 1, is the contour shown in Fig, 6-5. It consists of a vertical line 
| 
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wate lees eee Cet ali ee 
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Region II 


"ia. 6-5. Diagram showing pol i i 

pole locations, contours of integration, and i 

~ ver (From 8S. 8S. L. Chang, Statistical Design Theory of Digital Ceabolles 
ontinuous Systems, Trans. AIEE, pt. Il, vol. 77, pp. 191-201, 1958.) ; 


to the right of all the poles of 4(s) and a large semicircle at infinit 

or iclosing all the region to the left of aa’. Correspondingly, the series of 

iq. (6-49) converges for values of z outside a circle on the . plane, with 

radius larger than unity. However, once expressed in closed ford, ®,(z) 

also exists for values of z inside the circle. The tables for o-tranall m™ 

functions are usually calculated on this basis. ; 
Case 2. Double-ended Series, &;(z) 


Bz) = YS go(nT)e (6-51) 
y at 1 
where ( —-, &(s)e* ds fort 20 
eli Qa7 Jr, = 
a st es 
Oa} 4, ee dasa) fort. = 0 


Case 3. Z Transform of &(s), ®,(z) 


ro) 


2 1 
P2(8) = » ®(s + jnw,) (6-53) 


n=-© 


- #.(2) is obtained by substituting z for e*? in &*(s), The series of 
4), (6-51) is convergent on the unit circle of the z plane or in the neighbor- 
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hood of the imaginary axis on the s plane (shown as region II in Fig. 6-5). 
Integrals involving &,(z) and @,(z) are evaluated along the unit circle. 


It will be shown that the three definitions of &(z) are identical. In 
other words, when ®.(2), ,(z), and ®,(z) are expressed in closed form, 
they are one and the same function. 

Tet &*(s), &*(s), and 67(s) represent the respective sampled Laplace 


ce 


transforms. Let 


dr (t) = 4 6(t — nT) (6-54) 


n= 


and b7r_(t) = ) 5(t + nT) (6-55) 


n= 


The Laplace transform of Eq. (6-54) gives 
Ars(s) = ) em? (6-56) 
n=0 


The series on the right-hand side of Eq. (6-56) is convergent for posi 
values of s. In the region of convergence, it can be expressed in close 


form as 


1 | 
Ar.(8) = [=r (6-57) 
Similarly, for negative values of s the Laplace transform of Eq. (6-55) 
gives 


) 


s3T 1 
Ar_(s) = ) of =-—— = -T ose (6-58) 
r-(s) 1-—e ? e 


n=1 


From Eqs. (6-49), (6-50), and (6-56), *(s) can be written as 


H*(s) = Lf dalt)ir+(O)} 
= = J Ars(s — 8’)®(s’) ds’ (6-69) 
™) JT. 


where s’ is the dummy variable of integration and the path of inten 
IT; is traversed by s’. Equation (6-59) is a convolution integral in the 


sdomain. For the value of s to the right of aa',s — s' is positive a 
the entire path of integration. Therefore writing Ar;(s — 8”) in close 


form, according to Eq. (6-57), is allowable. 


i Jnl —e 


1 (s’) ds’ ' ; 
#t(8) = On) I —— iF for s in region I 
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Similarly, from Eqs. (6-51), (6-52), (6-57), and (6-58), 
$3 (s) = Li do(t)dr+(1)} + Li do(t)dr_(} 

ee: ®(s’) ds’ 1 ®(s’) ds’ 

~ Qnj ip ie epee “4 Qnj [. l— eer (6-61) 
Equation (6-61) holds for s in region II, that is, the space between T» 
and T;. With s in region II, the conditions of convergence for both 
Eqs. (6-57) and (6-58) are satisfied. 

In evaluating the contour integrals, considering s as being in region I 

in Eq. (6-60) means simply disregarding the poles at (s — s’)T’ = j2an, 
where v is an arbitrary integer, and considering s as being in region II in 


Iq. (6-61) means the same. Since &(s) does not have any pole in region 
II, T, and 1; of Eq. (6-61) can be combined to give T;. Therefore, 


B5(s) = HF (s) (6-62) 


To prove that ®*(s) is also the same function, let us consider an integral 
with a large circle of infinite radius as the contour of integration I; 


nd &(s’) ds’ 
217 Des if _ e—(s—-s)T 


= G (6-63) 


The integral vanishes since &(s’) approaches zero at least as fast as 1/s?. 
The left-hand side of Eq. (6-63) can be considered as the sum of two inte- 
grals, J; and Je. In evaluating J1, the poles at (s — s’)T = j2mn are 
disregarded, and in evaluating J2, the poles of #(s’) are disregarded. 
Therefore, 

$3(8) = J1 = —J2 (6-64) 


Let w, = 2r/T. The residue at s’ =s+jnw, is —(s + jnw,)/T. 
Therefore 


J, = -a PA ®(s + jnw,) = —#7(s) (6-65) 
l’'rom Eqs. (6-62), (6-64), and (6-65), the following equality is established : 
B3(s) = OF (s) = BF (s) (6-66) 


‘This is the same as 
@.(z) = &(z) = ®(z) = &(z) (6-67) 


‘The above proof justifies the use of standard z-transform tables for the 


eileulation of &(z). While it is proved only for the case of ®(s) not hav- 
ing any pole on the jw axis, the result is nevertheless applicable to cases 
it which #(s) has a finite number of poles on the jw axis, in the limiting 
sense that these poles can be approximated to any degree of desired 
fecuracy by poles not exactly on the jw axis, as discussed in Sec. 6-4. 


/ = 


j 
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| 6-6. Input-Output Relationships of a System with Sampling. In 
Sec. 3-6, we have shown that in a continuous, linear system, if the spectral 
densities of the input signals are known, the spectral densities of the out- 
put signals can be readily calculated. We shall derive here similar rela- 
tions for sampled-data systems. There are necessarily more varieties, 
because of the possibilities of changing a continuous signal into a digital 

b3(t) 


b(t) ba (t) 


Correlation 
computer 


Fic. 6-6. Block diagram illustrating input and output relationship ofa sampled system 
(case 1). (From S. S. L. Chang, Statistical Design Theory for Strictly Digital Sampled- 
data Systems, Trans. AIEE, pt. 1, vol. 76, pp. 702-709, 1957.) 


signal by means of a sampler, or a digital signal into a continuous sign 
by applying it to a hold circuit in combination with a continuous ¢o 
trolled system. There are three basic situations: 

Case 1. Correlation of Digital Signals after Transmission through & 
Linear Network. Let signals a(t) and b() be the sum signals, as shown 
Fig. 6-6, in which H.:(z) and H;(z) represent the transfer functions 
} stable systems or prescribed linear operators, and a;(¢) and b,(t) 
stationary random sequences of time, t=... , —27, —T, 0, T, 21 

In terms of the z transform, the two signals can be expressed 
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A(2) = ¥ Haz) A) (6-68) 
Biz) = » Fy ;(z) B;(z) (6-69) 

J 


It will be shown that 


$3(z) = bs \; A ai(2~!) Ho ;(2) a3,(2) (6-70) 


where ®,,(z) is the sampled spectral density of the resultant signals a(t) 
and b(t), and ®,,(z) is that of the source signals a,(t) and b,(é). 

Let A,,(z) and B,,(z) represent the z transforms of the sampled signals 
a(t) and b(é), respectively, where ¢ = 0, T, 27, ..., (m—1)T, mT. 
Then 


a(iT)zi 2 b(jT) 2-4 


0 j=0 


k=m-i 
a(éT)b(iT + kT)z-* 


Ane Bae) ; 


v 


ll ll 
Ms ims 


2 
i) 
> 


m m—k ‘ 
mig + YY) aG7yoGT + kT)-* (6-71) 
+=0 


k=—m i=|h| k=0 


Dividing Eq. (6-71) by m and taking the ensemble average result in 


x (An(2)Ba()) = y ( = x) bar(kT)2—* (6-72) 


bu = 
Letting m approach infinity, Eq. (6-72) becomes 


cc) 


dim - (An(2-)Bn(z)) = ga(kT)z* = Paz) (6-73) 


k=—0 


lor the system of Fig. 6-6, let A/,(z) and B/,(z) represent the z transforms 
of the sum signals a/ (¢) and b/,(£) when the source signals are limited to m 
mimpled sets of values. Obviously, a(t) and b(t) persist somewhat 
longer, because of the storage properties of H,,(z) and H;,(z). However, 
(his end effect has only a limited number of significant digits and is 
hogligible if m is made large enough. From Eqs. (6-68) and (6-69) we 
obtain 


Aj (2) Bi (2) = YY) Hae) Hp,(2) Am(2-1) Bn,(2) (6-74) 


‘ 


| a ey 


ee 
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Dividing Eq. (6-74) by m, taking the ensemble average, and letting m 
approach infinity result in 
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Jim © (Ane) BRC) = ys 7 Ha(@)Hy(2)®o0,(2) (6-75) 


Since the end effects become negligible as m approaches infinity, we have 
tim 2 (AL (e)BY(2) = lim + (An()Bu@) = Pal) 6-76) 
mo m m— © 

Equation (6-70) follows from Eqs. (6-75) and (6-76). 


Case2. Correlation between a Continuous Signal and the Response of 
a Continuous System to Digital Signal. For the system of Fig. 6-7, if 


b(t) al 


i i ing i i ip of a sampled sys- 
_ 6-7. Block diagram illustrating input and output relationship of a 
pio (case 2). (Prom S. S. L. Chang, Statistical Design Theory of Digital Controlled 
Continuous Systems, Trans. AIEE, pt. XI, vol. 77, pp. 191-201, 1958.) 


the correlation function ¢sa(r) or the spectral density 4,a(s) is known, 
what is ¢s.(r) [or ®s-(s)]? Both source signals a(t) and b(é) are assumed 
to be ergodic. H(z) and G(s) may represent physical elements or hypo= 
thetical linear relations, such as prediction. 

Let h, and g(t) represent the impulse responses of the respective ele» 
ments. In terms of the z transform, it is 


Hey Sher (6-77) 


It is understood that h, vanishes for large negative values of n, and g(t) 
vanishes for large negative values of ¢. The output variable c(t) can be 
expressed as 


c(t) = 2 >, a(mT)hn-mg(t — nT) (6-78) 


Equation (6-78) can be interpreted physically as follows: 2 a(mT')hy 


is the value of the impulse input to G(s) at time nT’. if (2) repressg 
a physical element, it responds only to past and present inputs; t 
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hn»—m vanishes for all values of m > n. If H(z) represents a prediction of 

k sampling intervals, hn» vanishes for all m >n-+k. In either case, 

the summation can be taken for all values of m, from —« to . The 

output c(t) is the sum total of the impulse response of G(s) to all previous 

inputs. As before, the summation in 7 can be taken from — to o, 
By definition of the correlation function, 


tulo) = arf, Oct +7) at (6-79) 


where 7; is a large interval approaching infinity. Let b(t) represent the 
variable which is equal to b(é) inside the interval —T, < ¢ < T, and zero 
elsewhere. Substitution of Eq. (6-78) into (6-79) gives 


deel) = a | -> > 


bi(a(mT)hag(t +7 — nT — mt) dt (6-80) 
Let t! =t— mT. Equation (6-80) can be written as 


hoc (7) = oF le » > bit’ + mT )a(mT )hag(t’ +r- nT) dt’ 


Pes & (6-81) 


In Eq. (6-81), the only factors depending on m are bi(¢ + mT) and a(mT). 
Since b; is nonvanishing in an interval 27), there are, within a tolerance 
of +1,27,/T7 nonvanishing terms in the summation over m. Because of 
the ergodic (or stationary) property of a(t) and b(¢), one has for large 71 


Lo] 


bill’ + mT)a(mT) = 77 dna(—t) (6-82) 


m=— 2 


Nubstituting Eg. (6-82) in Eq. (6-81) results in 


doc(T) = rf. bi: Ppa(—t’)hig’ +7 — nT) dt’ (6-83) 


n=—0 


!quation (6-83) represents composite convolution, summation, and inte- 
uration in the time domain. 


In terms of s-domain variables, it is 


%,.(3) = POOL MO (6-84) 


148 SYNTHESIS OF OPTIMUM CONTROL SYSTEMS 


Case 3. Correlation between Outputs of Continuous Systems with 
Digital Inputs. For the system of Fig. 6-8, if ¢a(n7') or &a,(z) is known, 
what is ¢cc() or Bee(s)? The same assumptions for a(t), b(t), Ga(s), and 
G,(s) are made as in the previous case. 


7 a(nT) G,(s) c(t) 


t) 


i i ing i ionship of a sampled system 
. 6-8. Block diagram illustrating input and output relationsh1 t 
riot 3). (From SS. L. Chang, Statistical Design Theory of Digital Controlled Continu- 
ous Systems, Trans. AIEE, pt. Il, vol. 77, pp. 191-201, 1958.) 


The output variable c(t) can be considered the sum total of impulse 
response of G,(s): 


c(t) = y a(nT)ga(t — nT’) (6-85) 


n=—0 


A similar expression holds for e(t). By definition, 


As . - tT dt 
dee(T) = oF fa 3 - a(nT)b(mT)galt nT) go(t + mT) 
Te Mate. (6-86 


As T, approaches infinity, the time interval in which go(t) has appreciab 
value becomes negligible compared with 71. Equation (6-86) can 
written as 


dee(T) = a i : > a(nT)bi(mT) galt — nT) g(t + 7 — mT’) 
n=—-2 m=— (68 


where bi(nT) = b(mT) in the interval —T1 < mT < T, but zero eb 
where. Let # =t—nT,andm=n+m’. Equation (6-87) becom 
by changing variables ¢ and m to t’ and m’, respectively, 


dbee(t) = or ime ‘S a(nT)bi(nT + m'T) 


=-Oo m=—o galt’)go(t’ ee ge mT) dt’! ( 
Similarly to Eq. (6-88), one derives 
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C-) 


a(nT)bi(nT + mT) = of gar(m’T) (6-89) 


n=—2o 


From Eqs. (6-88) and (6-89), 


dee(T) = ah pe ga(m'T)ga(t’)g(t! + 7 — m’T) dt’ (6-90) 


eo 
m'=— 


Equation (6-90) is the convolution sum and integral of three variables 


da(m'T), ga(—t), and g(t). In terms of s-domain variables, it is equiva- 
lent to 


Da(s) = Fr PE (Ge(—9)G4(8) (6-91) 


6-7. Stationarity of Continuous-system Variables in a Sampled Sys- 
tem. We note that the signals c(t) and e(t) are stationary in the long 
run but nonstationary within each sampling period. For instance, G(s) 
and G,(s) may have an impulse response of the form e~**, with 1/a much 
loss than 7. The mean-square values of c(é) and e(t) immediately after 
(he sampling instant are much larger than those values immediately 
before. The correlation function ¢..(n7) actually could be written 
(nT tr), indicating that the averaging is taken over values of c(t) and 
e(() at exactly ¢; after the sampling instants. Correspondingly, the 
wimpled spectral densities &,-(2,t1) can be defined as 


Bee(Z,ti) = DY beo(mT',tr)2™ 


n=—~% 


‘The mean-square value of e(¢) at ti after the sampling instants can be 
obtained by integrating: 


1 2 
aj P Peles) de 


The correlation function ¢:<(7) of Eq. (6-90) is obtained by integrating 
over all times, and the dependence on ¢; is averaged out. As a result, 
t.() of Eq. (6-91) is the sampled spectral density corresponding to 
averaged ce(7). The mean-square value of e(¢) obtained from &,,(z) is 
(he averaged value over all times. 

6-8, Optimum System with Random Inputs. Figure 6-9 represents 
the block diagram of a system with random inputs. The spectral densi- 


tien of the various inputs are known. For simplicity’s sake, we assume 
that the inputs are uncorrelated. Since the treatment can be readily 
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extended in case this assumption is not valid, there is little to be gained in 
treating a general case. We assume also that the functional constraints 
and power constraints are the same as those of Sec. 6-3. 

In previous sections, we defined a closed-loop digital transfer function 
K(z) such that, with an input r(t), the z transform of the output variable 
is C(z) = K(z)R(z). This function is now denoted as K,(z). We note 
that, with a total input ri(t), mit) = r() + rn), the digital signal 
ex(t) applied to the plant and hold system is given by its z transform, 


K,(z) 
Gz) 


evens o— 


Hold system 
and plant 


Fia. 6-9. Block diagram showing random inputs, system variables, and definition of 
system error in a sampled-data system with inner feedback loop. 


E.(2z) = Ki(z) Ri(z)/Gil). To take the effect of load disturbance w( : 


into account, a similar transfer function K 2(z) is defined in the followi 
equation: 
1 

Ex(2) = az [Ki@) Ril) + K2@)W@)I (6-92 

Gy(Z) 

where W(z) is the z transform of w(?). The functional constraint 
K,(z) is the same as that given by Eq. (6-8). 

The function c(t) is given by its Laplace transform: 


ce) = 29 iKroR® + OW + WO OM 


The system error is given as 
E(s) = L(s)R(s) — C(s) 


The mean-square value of e(t) is 
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a ee 
e= Inj ses ®..(s) ds 
ig eS 
= 955 P Peel) de (6-95) 


The last equality sign is shown in the same wa 
: y as Eq. (6-21). L 
represent the desired output. Then . ) et cx(t) 


Ci "os 
From Eq. (6-94), eae pes 
Peo(S) = .,¢, (8) ae ,.(8) a ,,-(8) ce ®.¢,(8) (6-97) 
By definition of c,(é), we have 
,,-,(s) = L(s)L(—s)®,,(s) (6-98) 


[Equations (6-84) and (6-93) give 


_ 1 K%(8)G,(s) 1 K¥(s)Gi(s)L(—8) 


®.,c(8 7 Ge Or 768) = T Ge (s) ®,,(s) (6-99) 


I'rom Eqs. (6-91) and (6-93), 
1 Gi(s)Gi(— 
als) = HGP ENE ay (KTOKY-O82O) + HO] 
K3(s)Gi(8)Pww(s) , KF(—s)Gi(—s)Pvw(s) 
TG#(s) TG}(—s) 
ai Pw(8) (6-100) 
rom Eqs. (6-97) to (6-100) we obtain finally the following z-transform 
equation: 
D,.(z) Ls LL®,,(z) can * Eile eteele ro Bag eae 
1 GGi2) 
I Ge {Ki(z)Ki(2)[®,-(2) + ®nn(z)] + K2(2)K2(2) Bww(Z) } 
K2(2)Gs®ww(2-}) K(2-)G®ww(2) 
TGi(z) PGE) 


+ K%(s)K3(—s)%,,(s)} + + 


te + Byu(2) (6-101) 


Nimilarly, the mean-square value of the constrained variable a(t) is shown 


to be 


(6-102) 
where 


i GG; (z) 


Be) = Maier) (ROK bol) + Pan(e)l 


+ Ko(z)K2(2-")Pww(z)} (6-103) 
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The optimum system functions K1(z) and K2(z) can be determined by a 
method similar to that used in Sec. 6-4. However, as there is more 
than one unknown function, it is desirable to make use of a variational 
theorem in the zg domain. Let the integrals J, be defined as 


1 ap =. a 

i. ‘= aaj P Pm eal, ae Ss ee e 
MeO too... .., 0) (G=1049 
where F(z) = F;(z-!). Conditions similar to 1, 2, and 3 of Sec. 4-3 are 
assumed to be satisfied. The necessary as well as sufficient condition for 


M 
J = y daln 


to be stationary with respect to infinitesimal variations of Fi, Fs, .. . 
and corresponding variations of Fi, Fe, ... is that the functional 


derivatives 
M 
X%=r », Xm oom (6-105) 
OF; 
m=0 


do not have any pole inside the unit circle. 
Making use of the above theorem and following through the steps of 
Sec. 6-4, we obtain finally 


. if GiL®,,(z) 
Ki) = Fw Es me |, 


i Po 1 GP wy» (2) 
Kx@) = — Wee) laren ee |, 
where Z(2)Z(2-?) = Be (z) + Pan(Z) (6-108) 


All the poles and zeros of Z(z) are inside the unit circle. Y1(z) is defined 
by Eq. (6-46). From K,(z) and K2(z), Di(z) and D2(z) can be readily 
calculated. 

The following example illustrates some typical calculations. 


Example 6-2. For the system of Fig. 6-9, the known conditions are 


1 — e* 10 
GS) “ea 
L(s) = eT 
; 2vA2 
Prr(jo) = tia 
Prn(jw) = 0 


dai(r) = 1 r=(0 
dun(r) = 0 T = Zz 
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To design a data-processing unit which gives the lowest mean-square error, without 
constraint, we calculate 


_ 3.94241 
Gi) = 70 6061 
9.77 (1 + 0.2642) (1 + 0.2642-1) 
Gg = 
1G, (2) (1 — 0.6062) (1 — 0.6062—) 
dy5(2) = 1 
2 am Ae 
®,,(z) = tk qz=er 


(92) =e) 
ql — az)(1 — az~}) 
a(1 — gz)(1 — gz) 
where a is a constant less than unity and satisfies the following equation: 


A*(l — g*) _ @ —a)(1 — ag) 
q aq 
PE 
: m 
Ye) = 2.5 NES (1 + 0.2642-!) (1 — az) 
20vA2(1 — e7*)enTs 
(v? — s*)(1 — 27's)(—s) 
4 10a = Oe 
@+a)d +27), 1 °° 
A 10A°(1 — q)qr 
2 GL4,,,(z) — vl a 2Tr) (1 a + . 
[ GiL®,r(z) ] af 10A*(1 — q)q" 1 
Giz) Vile )Z(z) J, oo 4 270) [Z(2) Y(2)4(2)]-724 1 —qzt 
¥ is 1.264Cq"(q — a)z7} 
° (+ 0.2642-4) (1 — az) 
= Caw ics 1.264 1.606q 
Trq(1 + 27) 1 + 0.264q 0.394 


6-9. Optimization of Transient Response to Unsynchronized Deter- 


ministic Inputs. Unsynchronized deterministic inputs can be treated as 
tundom inputs with spectral density 


®,,(s) = AR(s)R(—s) (6-109) 
where A is a constant. Equation (6-109) follows directly from Eq. 
(4635). 

6-10. Optimization of the Presampling Filter and Wave-shaping 
Vilter.t Figure 6-10 illustrates a system for transmitting a continuous 
“ignal over a sampled-data link. The sampling switch generally repre- 
sents the necessity of transmitting the signal as a sequence of numbers, 
rather than representing an actual switch. For instance, we may wish to 
transmit & continuous signal accurately by short wave, and the most 
teliable way of doing this is pulse-code modulation. The sampling opera- 


| Parts of Bec. 6-10 are reprinted from a paper by the author, Optimum Transmis- 
won of Continuous Signal over a Sampled-data Link, ATER paper 60-1243, 1960. 


Z(z)Z (271) = $,,(z) + Pnn(Z) = 


=1 
‘=] 


G(—s)L(s)®rr(s) = 


ee 
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| 

} 
| tion, which is part and parcel of pulse-code modulation, is rather undesira- 
} ble in that it causes frequency aliasing and mixes high-frequency noise 
| into the signal band. This effect can be substantially reduced by using a 
low-pass filter F(s) before sampling to filter out most of the high-frequency 
noise. At the receiving end, a wave-shaping filter G(s) is used to recon- 
struct the continuous signal from the sampled signal. 

The optimization problem can be formulated with reference to Fig. 
6-10. The input signal is ri(t) = r(é) + n(t). The desired output signal 
ci(t) is the output of a linear operation L(s) on r(é). Given the spectral 
densities ®,,,,(jw), the network functions F(s) and G(s) to give least 
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Sampling 


Fria. 6-10. Block diagram of a sampled-data transmission link. (From S. S. L. Chang, 
Optimum Transmission of Continuous Signal over a Sampled-data Link, AIEE paper 
60-1243, 1960.) 


[e()F and the minimum value of [e()? are to be determined for the follow- 
ing three cases: 

1. F(s) is fixed, and G(s) is to be optimized. 

2. G(s) is fixed, and F(s) is to be optimized. 

3. F(s) and G(s) are to be optimized simultaneously. 

In the present and next sections, our mathematical manipulations are 
done mostly in the s domain, and we shall use the same shorthand nota- 
tions explained in Sec. 2-4 following Eq. (2-13). Whenever sampling 
is involved, AB* = A(s)B*(s), meaning that B(s) alone is sampled, and 
(AB)* = [A(s)B(s)]*, meaning that A(s)B(s) are sampled together. We 
further define the following mathematical symbols: 


PR ge et Sea 


The function inside the braces is always symmetrical in s and —s and eft 
be factored so that { }+ and its reciprocal are analytic in the right ha 
of the s plane and {  }~ and its reciprocal are analytic in the left half 
the s plane. 

{ }*£: The same definition is applied to sampled functions. { 
is a rational function of z. Let \ and 1/A be a pair of its poles (or er 
with |A| <|. Then { }** is written in factors of the form 1 = 
and {| }*= is written in factors of the form | — dz. 
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These rules are the same as the corresponding ones in Secs. 2-4 and 6-4 
defining Y(s) and Y(z), respectively. 
With reference to Fig. 6-10, the mean-square error is given as 


ly Wye 
[e(t)|? = ag eae ®,.(s) ds (6-110) 
where ®re = Pec, — Pac — Pec, + Pec, and the latter spectral densities are 
given as 
&,,-, = LL4,, (6-111) 
1 ees 
Dec = T G®,,i = p PGL®,., (6-1 12) 
ee 
Pee, ae T GL®,,, (6-113) 
i Mee _ 
Bee = T (FF®,,,) *aG (6-114) 


Equations (6-111), (6-112), and (6-114) are obtained from Eqs. (3-17), 
(6-84), and (6-91), respectively. Equation (6-113) is obtained by chang- 
ing s to —s in Eq. (6-112) and noting that %,,.(s) = ®cc,(—S). 

The only requirement on F and G is that they be analytic in the RHP. 
With 2 fixed, Eq. (4-37) gives a condition to be satisfied by the opti- 
mum 3 


Oba fo lig a ., 
p Plane + FF Onn) = Xs 


=, (6-115) 


where X,(s) is an unspecified function analytic inthe LHP. The solution 
of Eq. (6-115) is readily seen to be 


Oe Q 


(FF@,,,,}*+ ame 
FL® 
where Owe | £ 
Pree =i eh 


To derive an expression for the mean-square error under this condition, 
we obtain from Eq. (6-115) 


of, Se ee Ae alee: 
nj io ce, c)ts = — ony a iG as — 0) (6-118) 


The last equality sign is due to the fact that both X, and G are analytic in 
the LHP. Replacing the dummy variable s by —s on the left-hand side 
of Iq. (6-118) gives 


a i * @ae = a) ds = 0 (6-119) 


2nj J -i 


4i 
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Substituting Eqs. (6-118) and (6-119) in (6-110) gives 
Pe es 1 QQ 


je 
le(t) Gon = oy ee (Beye, — Bee) ds = Oni | " (Pa _ +) ds (6-120) 


The above completes the equations for optimizing G(s). The problem 
of optimizing F'(s) with G(s) given is reduced immediately to the same as 
the previous one by proving the following equation: 


Bey ae de eS a -121 
Omi ‘os A*(s)B(s) ds = Onj sue B*(s) A(s) ds (6-121) 
The above equality is to hold for all even functions A(s) and B(s) with 


finite a(nt) and b(nt), respectively. 
Equation (6-121) is readily proved as follows: 


oj i A*(s)B(s) ds = oa - A*(s)B(—s) ds 
= ty b(t) » a(t)é(t — nT) dt 
= , a(nT)b(nT) (6-122) 


The first equality sign is due to the fact that B(s) is even; the second is 
due to Parseval’s theorem. Since the right-hand side of Eq. (6-122) is 
symmetrical in a(t) and b(t), Eq. (6-121) is proved. 

With the help of Eq. (6-121), Eq. (6-110) can b2 written as 


le(t)]? = : | “ (® — Bo, — Baye + Peie:) AS (6-123) 


Qnj —jo 


where e. - (GQ)*FF®,,., (6-124) 


In optimizing F(s), Eqs. (6-123) and (6-124) take the place of Eqs. (6-110) 

and (6-114). The same steps which led to Eqs. (6-115), (6-116), and 
(6-120) give 

— OLA + FGA)" ank = Xs (6-126 

jay ast 

{GG} *+ {Br }* 


Te - id P {5 (0 - f ov) ds 
=je 
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where X» is an unspecified function analytic in the LHP and U is given as 


GL® 
Oe mee] (6-128) 
bogie i 


To summarize the above results, Eqs. (6-116) and (6-120) give the 
optimum wave-shaping filter G(s) and the least-mean-square error, 
respectively, if F(s) is fixed. Equations (6-126) and (6-127) give the 
optimum presampling filter /(s) and the least-mean-square error, respec- 
tively, if G(s) is fixed. For the optimum transmission system, Eqs. 
(6-115) and (6-125) are to be solved simultaneously. While all four equa- 
tions (6-116), (6-120), (6-128), and (6-129) are satisfied by the optimized 
functions F(s) and G(s), they do not represent a solution of the problem 
since both F(s) and G(s) are unknown. 

In order to arrive at a solution of Eqs. (6-115) and (6-125) simul- 
taneously, one more general relation will be derived: Multiplying Eq. 
(6-115) by TG(—s) and Eq. (6-125) by TF(—s) and taking the difference 
give 

{FF®,,,,}*GG — {GG}*#,,,.FF = TXiG — TX.F (6-129) 

The expression on the right-hand side of Eq. (6-129) is analytic in the 
LHP. Therefore the expression on the left-hand side is analytic in the 
LHP. As the latter expression is symmetrical in s and —s, it is neces- 
sarily analytic in the RHP also. Therefore, it must be a constant. The 
constant can be determined by integrating the latter expression from 


-jo to jo. Equation (6-121) gives the integrated result as zero. 
Therefore, the constant is zero, and it follows that 


{FF®,,,,}*GG = {GG}*®,,,FF (6-130) 


I'quation (6-130) is to be satisfied by the simultaneously optimized F(s) 
and G(s). Substituting Eqs. (6-116) and (6-126) in Eq. (6-130) gives 


QQ = UU (6-131) 


ecause of the requirement that all the poles and zeros of Q(s) and U(s) 
be in the LHP, Eq. (6-131) gives 


Q(s) = U(s) (6-132) 
!quation (6-116) is multiplied by its conjugate to give 


{FF ®,,,,}* 


‘The sampled form of the above equation is 


(GG\*{FFH,,)* = (QQ\* (6-133) 


a 
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On account of Eqs. (6-132) and (6-133), Eq. (6-126) can be written as 
oe Q 
{FF®,,,} ad 


{QQ} {Brn dH 
Equation (6-117) becomes 


Aa QL, 

{QQ} *-{®nr}7 oe 

Similarly, by using Eqs. (6-132), (6-133), and (6-116), Eq. (6-128) is 
reduced to exactly Eq. (6-134). 


Equation (6-134) can be solved for Q. The only poles of the bracketed 
expression in the LHP are the ones due to L4,,-. Let these be denoted as 


vi. 
A; 
yes 


The coefficients A; are determined by substituting Eq. (6-135) back into 
Eq. (6-134). Equation (6-133) gives 


(GG) *+{FFS,,,)** = {QQ}** 


(6-134) 


(6-135) 


(6-136) 


However, there is no way of determining the two factors on the left-hand 
side of the above equation separately. This is not surprising, since if 
we write F = F*F, and G = G*G@ the block diagram of Fig. 6-10 gives 


C = (RF)*G = (RF2)*F{GiG2 


Therefore, C(s) is indifferent to the distribution of F*(s)Gf(s) as long as 
the product remains unchanged, and consequently one is at liberty to 
choose one of these functions. As the sampled network function is much 
more costly to realize before sampling than after, one requires that 

Ce ae 


where B is a constant to be selected as one wishes. Equations (6-116) 
and (6-126) become 


BQ 


= 6-138 
{QQ} ** i? 
Q 


F = >= 
B {Prins a4 
Equations (6-137) and (6-138) give the simultaneously optimized netw 


functions, while with the optimized Q(s) Eq. (6-120) gives the least va 
of mean-square error, 
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Example 6-3. A problem of optimum filtering and prediction in the presence of 
uncorrelated white noise will be used to illustrate the method. 


ei(t) = r(t + a) L(s) = e 
2a 
@,,,-(s) = ae aaa? 
_ 2a o2(b2 — 8? 

$r,7,(8) ad Guicust +o = “ a? > 
where b? = a? + 2a/c%. Determine optimum f(s), G(s), and the extra error due to 
sampling. 

Solution. Equation (6-135) gives 
A 
We) <7 ae 


The braced functions in Eqs. (6-137) to (6-139) are calculated to be 
A*(1 — e247) /2a 
a ae e79Te-Ts) el ae e787 e?*) 


(Q(-2)Q(0)}* = AVG =e #7) /2a 


1- eT eTs 


{Q(—s)Q(s)}* = 


_ . o(0 ='8) 
{Snr (s)J7 = tC aeeiatt 


Substituting the above results in Eq. (6-134) gives 


eg ee 2a(1 — e7**7) 
o(b +a) 
For numerical simplicity, let 
B= e-%@ V/ (1 — e-%7) /2a 
quations (6-138) and (6-139) give 


G(s) = aa Disiel Me Ry 


pe (6-140) 
& 2a s+a 2a 1 
F(s) = 
cb +a)(st+ayc(s+b) o(b+a)s+b 
Since o? = 2a/(b? — a?), the above equation can be written as 
b-—a 
F(s) = ape | (6-141) 


The function G(s) of Eq. (6-140) can be separated into a digital filter and a low-pass 


oye er with F(s) of Eq. (6-141), they are represented by the block diagram 
OF Pig. O-L1. 


G(s) 
F(s) 
r(t)+n(t) b-a — aa 
a = -aT _-1 
ery l-e“"z a c(t) 
Prefilter Digital filter Wave-shaping 


filter 
ng My 1. Block diagram for the optimized sampled-data transmission link of Exam- 
MO Ded, 
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The mean-square error is given by Eq. (6-120): 


rin oe ee tee ee 
le@? =1— aon = ob + aT 
‘e © (b = a)(1 24 e247) g—2a0 
sob 2aT(b + a) oe 


At T approaches zero, Eq. (6-142) approaches the known result of a Wiener filter: 


_—— poe 
[ela = 1 —-ppge™ 
The excess error due to sampling is found to be 
—_—- +» -—~; —©| we Ne ) (b — a)e~2a2 
POF - hOB = (1-—Sap—-) “Fa (6-143) 


6-11. Optimum Presampling Filter and Hold Circuit in Control 
Systems. The method of the preceding section can be modified slightly 
to optimize the presampling filter, the digital processing unit, and the 
hold circuit simultaneously in a sampled-data feedback control system. 
Sometimes a designer is not at liberty to change the presampling filter 
or the hold circuit. This presents no problem. As we shall see, the 
method can be used to optimize the digital processing unit together with 
either or both of the other two units. 

The essential difference between optimum filtering and control prob- 
lems is that in the latter the constraints due to the controlled plant must 
be considered. In general, we can find equivalent closed-loop transfer 
functions F(s) and G(s) such that 


C(s) = [F(s)Rx(s)]*G(s) (6-144) 


where G(s) or F(s) is usually a product of a sampled function and an 
unsampled function of s. While the method we shall discuss is not based 
on any particular arrangement of control components or block diagram, 
it is desirable to use one to illustrate the physical significance of F(s) 
and G(s). For the system of Fig. 6-12, Fi, Gi, Gp, and H are unsampled 
functions of s, and D* is the usual digital processing unit. 


E, = FiR,; — E}D*GF1 (6-145) 
where G= ae (6-146) 
Applying the sampling operation to Eq. (6-145) and solving for 2} give 
| iit we 
1+ (G.F1)*D* 


G,(FiRy)*D* 
T+ Gky*D* 7 


C = GE} = 
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Fie. 6-12. Block di i i 
shepeie tite “e ot of a feedback system with presampling filter F1(s) and wave- 


Comparing Eqs. (6-144) and (6-147), we see that 


F(s) = Fi(s)B*(s) (6-148a) 
G(s) = as 
[1 + (GF) *D*]B* (6-148b) 


where B* is arbitrary. If the same reasoning which leads to Eq. (4-10) 
is applied to Eq. (6-148b), it is obvious that the transportation lag and 
nonminimum-phase part [e-*P(s)] of G,(s) must also be a part of G(s) 
Therefore we write 

G(s) = e-*P(s)G2(s) (6-149) 


where G2(s) is required to be analytic in the RHP but is otherwise free. 
The amplitude constraint can be written as 


la)? <M 


phere A(s) = G;(s)C(s)/@,(s). Among the spectral densities given in 
Iiqs. (6-111) to (6-114), the significant modifications and additions are 


ins TPG Pele, (6-150) 
Bee = Fr PPE) PPCM: (6-151) 
0, = 1 Pin) PROGLE, age 
‘The integral to be minimized is 
(OP + MOE = ots [Cent WP) ds (6-158) 


l’ollowing the same steps as in Sec. 6-10, we obtain the optimized func- 
(ions for the following cases: 
| 1, Prefilter F(s) is known; D(z) and G(s) are to be optimized. In 
(his case we let B*(s)=1. F(s) is then completely known. The 


ee [i ; y Per'L®,,, A. 6-154 
Y {FF®,,,)** Y {FF®,,.,)*- i ( = 


optimized G(s) is given as 


—————<<_<— TTC 
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| P kG:Gs)* (6-155) 
where Y= ae ae GG, 


In other words, 1/Y is obtained by a root-square-locus plot of the transfer 
function G,/kG3, keeping the original zeros as its zeros. D@) and G(s) 
are obtained by comparing Eqs. (6-148b) and (6-154). Equation (6-120) 
is still correct for calculating e + k%a® with the new value of Q given as 


ties Pee L® yy | 
beatae ar 
2. The continuous part G(s) is known; D(z) and F;(s) are to be opti- ~ 
mized. Let 


(6-156) 


D* 
“1+ Grob 


Then G(s) = G(s), and the optimized F(s) is given as 


B* (6-157) 


(dt aS (6-158) 
} {Syr,}+{Y YEG) *+ 


Ge L® yr 


“ (6-159) 
0 leap ioar- |: 


where 


and Eq. (6-127) is still correct for calculating ¢ + ka? with the new U~. 
3. All three functions F1(s), D(z), and G(s) are to be optimized 
simultaneously. ‘The results are 


ear e, ae | (6-160) 
faa Recuszcer + 
_ Be (6-161) 
Y{QQ}** 
ee” 6-162 
# ne Bi ®y2,}7 ¢ 


where B is an arbitrary constant. 


Example 6-4. A follow-up control system has the same signal and noise spectral 
densities as the one in Example 6-3. The controlled plant has a transfer function 
K/s, and the mean-square value of the input i(t) to the controlled plant is limited; 


fa? <M 


Determine the optimum F'(s), D(z), and G(s). 
Solutions. Under the given conditions, 
L(s) =1 G3(s) =1 
Yalt+peni+hs 


ev? = 1 
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Equation (6-160) gives 
ou V 2a(1 — e247) 1 
~ o(6 +a) + ak’)s +a 


Let B = JV (i — e-**7)/2a as before; then 
1.= e¢Te-Ts 


Gs) = G+¥H6 Fa) (6-163) 
re b-—a 
FG)’ = Ga LD (6-164) 


Equations (6-163) and (6-164) can be readily realized by the block diagram of Fig. 
6-13. We note that the optimum data hold has an exponentially decaying character- 
istic instead of the conventional constant value over a sampling period. 


Low-pass AG 
filter Hold circuit 


Fig. 6-13. Physical realization of an optimum system (Example 6-4). 


Amplifier Plant 


6-12. Summary. A few selected topics on the optimum design of 
sampled-data systems are treated in this chapter: 


1. Deadbeat systems 

2. Systems with least-integral-square error to deterministic inputs 
subject to plant saturation 

3. Systems with least-mean-square error in the presence of noise and 
load disturbance and also subject to plant saturation 


The emphasis is on the design of a digital computer or sampled-data 
processing unit which receives and delivers a series of impulses to control 
4. continuous plant. The manipulated impulses are such that they give 
the best match, in a prescribed sense, of the continuous-plant response to 
(he desired response. In cases where the designer has a free choice of the 
presampling filter and (or) the hold circuit (postsampling wave-shaping 
filter), methods are also developed for simultaneous optimization of the 
wimpled-data processing unit together with either or both of the con- 
(inuous filters. 

Various ways of defining the sampled correlation function and sampled 
spectral densities are given. The equivalence as well as differences of 
(hese definitions is shown. The input-output relations of sampled ran- 
dom signals as well as continuous random signals in sampled systems are 
derived. A variational theorem in the z domain is stated but not proved, 
an this can be done readily by a method similar to that of Sec. 4-3. 

Uxamples are given to illustrate the various design techniques. 


CHAPTER 7 


SPECTRAL-DENSITY ESTIMATES 


In the previous chapters, we have visualized the significant role played 
by spectral densities in the design of optimum systems with random 
inputs. Little has been said about the accuracies of experimentally 
determined spectral densities. For stationary signals with unlimited 
data available, the spectral densities can be determined to any degree of 
accuracy, but this condition does not exist in many situations. Just to 
name a few: 

1. In the experimental study of missiles in flight, every bit of data is 
obtained at considerable cost. 

2. For adaptive systems, admitting random inputs, the input signal is 
assumed to be undergoing adiabatic change. In order that the system be 
constantly tuned to give optimum performance, we must be able to deter- 
mine the most up-to-the-second signal properties with the maximum 
accuracy. The requirement of up-to-the-second-ness and the require- 
ment of accuracy are not compatible. 

3. Even for stationary random signals of unlimited availability, 
economic considerations of computer and manpower usually limit the 
amount of data which are actually analyzed. 

Considering the above, we see that some basic knowledge of spectral+ 
density estimates is of vital importance in control engineering. 

7-1. The Nature of Measured Spectral Densities and Correlation 
Functions. In Sec. 3-5, the spectral densities and correlation functions 


are defined in terms of random variables which remain stationary and. 


may be measured for as long a period 7; as we wish. However, in all 
practical measurements, the duration 7, is finite. Because of the rane 
dom nature of the variables, no two measurements are expected to agree, 
The spectral densities as computed from one sample record differ from 
that computed from another, and the results are expected to be scattered 
somewhat rather than falling exactly on a smooth curve. In Fig. 71, 
the solid curve represents the mean value of measured spectral density 
from a large number of sample records, while the broken curve represet 
that from a single sample. The same situation exists with meast 
correlation functions. 
The scattered nature of the measured spectral density does not ork 
14 
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nate from the measuring technique or the instrumental accuracy. We 
may think of it as nature’s way of baffling us when we try to squeeze too 
much information out of a single specimen which does not have that much 
in it. More specifically, there are two main sources of uncertainties: 

Uncertainty of Frequency and Time. In order to determine the fre- 
quency of a signal to an approximate accuracy of Af, the length of the 
sample signal required in the analysis is at least 1/Af. 

Statistical Uncertainty. There is an inherent uncertainty in the process 
of estimating the properties of the entire ensemble from a limited number 
of observed samples. For instance, we have to throw a die many more 
times than the least possible six to know that there are six faces on the die 


P,, (Jw) 


Fira. 7-1. Typical random variation of power spectrum computed from a single sample 
record of finite duration (broken curve) from the averaged power spectrum computed 
from a large number of sample records (solid curve). 


and that each face has the same probability of appearing, if this knowl- 
edge is to be gained entirely by experiment. 

In the subsequent sections, we shall determine the inherent limitations 
on the accuracies of computed spectral densities and correlation functions. 
‘These limits depend on the number and length of sample records, required 
resolution in frequency, and the random nature of the signals being meas- 
ured. For the latter characterization, we shall assume that the signals 
are Gaussian. As discussed in Sec. 3-2, a Gaussian signal can be thought 
of as the sum total of a large number of independent contributing signals, 
(he magnitude of each contributing signal being insignificantly small. 


ln nature, the following signals are among the notably Gaussian ones: 
|, Pressure and height of ocean waves 
Pee ; F - 
2. Pressure and instantaneous velocity of air turbulence 
‘ ry . . . 
4, Thermal noise, shot noise, radar noise, ete, 


ghee ernie 
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One may think of the Gaussian case as an extreme example in random- 
ness, and reliability estimates based on Gaussian signals are pessimistic 
for signals that may be random but not Gaussian. 

7-2. Error Due to Uncertainty in Spectral-density Measurements. 
Let us consider a sample signal z(t), of length 71, generated by a given 
stationary random process. It can be expressed as a Fourier series: 


i® = cs 7 > (Ae 008 cot + By sin wf) (7-1) 
n=1 
where wn, = 2an/T; and 
T1 

ae 7 ve {(d).008 eet dt (7-2) 

1.70 

2 ; 

Ba == a(t) sin wat dt (7-8) 

T1 Jo 


We shall refer to the mean-square value of signal amplitude as power. 
The power contained in the nth spectral line is 


P, = (A,? + B,?) (7-4) 

For a stationary random process, one instant of time is just as good as 

another, and there is no preferred phase for any sinusoidal component of 

a(t). Therefore 

(An) = (Bn) = 0 (7-5) 

and (An?) = (Br?) (7-6) 

The Fourier coefficients A, and B, have zero mean, but their standard 

deviation is, in general, not zero. These coefficients have the same 

nature as the net distance traveled in a random-walk problem with equal 

probability of moving in either direction. The analogy can be better 
demonstrated by rewriting Eq. (7-2) as follows: 


m—1)T2 


M 
A,T,=2 ) | Sie i(t) COS wnt dt (7-7) 
m=1 


where M = 7:/T>. In Eq. (7-7) the integral from 0 to T, is broken up 
into little segments of width T2 each, and T» is selected so that the 7(¢) in 
different integrals are substantially independent. Since each integral 
has equal likelihood of being positive or negative, the expected value of 
the total sum A,7'1 vanishes, which is in accord with Kq. (7-5). For the 
same 7's the mean-square deviation ((71A,)?) increases proportionally 
with M and therefore with 7: 


((T1An)*) = T1XA,?) = K(on)T1 


where K(w,) is a proportionality constant independent of 7, but is a fune 
tion of wn. 
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We may write the above equation in another way: 
K (on) 
2 a. n 
(4a) = ae (7-8) 


What is the significance of K(w,)? With a sample signal of length T 
the spectral lines or Fourier components are at frequency Ae Ne» 
1/ Ti apart. For any given frequency range Af, the number of spectral 
lines inside that range is 7; Af, and the total power inside that range is the 
measured value of 4;:(jw,) Af, by definition.t Therefore, 


si( jon) meas Af = » 44(A,? Sf B,,”) (7-9) 


where the summation is over all spectral lines inside Af. The spectral 
density ®;:(jwn) is the expected value of the measured ;;(jw,). Conse- 
quently, Eq. (7-9) can be written as 


Pii(Jon) Af = » 48((An?) + (Br?)) (7-10) 
From Eqs. (7-6), (7-8), and (7-10), we have 
K (cn) 


®;;( Jon) Af = Ts Af f 
ry : 
Cherefore, K(@n) = ii(jon) (7-11) 


In the above we have shown that A, has zero mean and standard 
deviation ~/%;(jo,)/T;. What is the distribution of A,? With refer- 
ence to Sec. A-6, if 71 is sufficiently long or the number of terms M in Eq 
(7-7) is sufficiently large, A, is expected to have Gaussian distributions. 


p(A,) dA, = 


e-Ant/2En dA, (7-12) 


1 
V/2nE, 


where EF, = Pii(Jon)/ Tr. From Eqs. (7-4) and (7-12), the power of the 
nth spectral line P, has a probability density p(P,): 


p(P,) dP, = / Maga) ad de. 


1 
= opi / ePol Bs dA, dB, 
1 
dn (7-13) 


lhe integral sign is over a zonal region P, < 14(A,2 + B,2) < P, + dP 

i" " "J « ei 

l'rom Eq. (7-13) we see that the power of each spectral line has an 
| To agree with most of the literature on power spectra measurements, ®;4(jw) is 


taken to be the power per unit real frequency range in Chap.7. Its value is twice that 
of the theoretical (jo) in other chapters of the book. See See. 3-6. 
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exponential distribution. Its value could be anywhere from zero toa few 
times E,. The expected value of P, is E,,, and its mean-square deviation 
is 


(Px — Ex)*) = f° (Px — Ea)*(Px) Pn = Ent (7-14) 


Equation (7-14) shows that, if we measure the spectral density ®ii(jon) 
by calculating P, for one frequency alone, the probable error is 100 per 
cent. However, if we sum over P, for all the components in a frequency 
range Af, as shown in Eq. (7-9), both the mean value and mean-square 
deviation are multiplied by the number of components 7, 4f. The 
percentage of the probable error is reduced: 


/ (EG) meas — OGo)P?) _ 1 
B(jw) /T; Of 


where ¢ is the expected per-unit error. 

The same sort of reduction in probable error can be obtained by 
averaging the spectral densities analyzed from N independent samples of 
a(t); Eq. (7-15) is replaced by 


(7-15) 


Ill 


1 
ee 7-16 
©" NT AS 7 
Alternatively, we may write 
1 


il e@N Af 


(7-17) 
Equation (7-17) gives the required sample length for an expected per-uni 
error «and resolution Af. We note that the same degree of accuracy and 
resolution can be obtained by either increasing the number of samples 0 
increasing the length of each sample. However, in the former case, 
resolution is always limited by the following relation: 


1 
Af 
Equation (7-18) states simply the impossibility of distinguishing two f 
quency components Af apart in a sample of length less than 1 /Af.t 

7-3. Direct Measurement of Spectral Densities by Analog Meth 
It is generally true in electronic computing devices that, if the desi 
accuracy is not exacting, analog methods are cheaper or simpler t 
digital methods. The dividing line is approximately at an allowed er 
figure of 0.1 per cent. In spectral-analysis work, instrument error of 
order of 1 or 2 per cent is considered tolerable, since the inherent error d 


Se (7-18 


} Inequality (7-18) can be put in a more precise form: Ef the signal i(t) of length: 
is repeated indefinitely, the power spectrum becomes bunched into spectral 
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to statistical inties i 
ib sai igri is of the order of 10 per cent or more in most 
The basie arrangements of Fig. 3-2 i iti 
and cross-spectral densities bied atelier Se eet eee 
there have been various methods published in the iterations th es 
more or less variations of the basic method in order to 0 egies 
another of the following difficulties: co eee 
* v Servo signals usually have very low frequencies. Some have periods 
. nik sirote / parca than in seconds. The filters are costly, and 
; akes to analyze a point on the spectrum is prohibitivel 1 
2. A tunable filter or a large number of band al i Se 
be used in order to obtain sufficient poin oe fa =, 
versus-frequency curve. It is difficult ue poss a age ret 
versus frequency curve the same (or even constant) at differ ye 
quencies. Scale corrections are neither convenient nor Rn she 
a be heeniaes cross-spectral densities the bandpass filters sae to 
. e ; y matched in phase-shift characteristics. This is very difficult 
o do for filters with narrow passband and sharp cutoff characteristi 
A workable device must overcome all the above difficulties, F mer 
speed-up process, the signal is generally recorded on magnetio rte : 
sound film and played back at much higher speeds. Magnehie tale 


separated from each other b ini 

y 1/T;, and the minimum uncertainty i 

1, an ty in f i 
the order of 1/7. If the signal i(t) is not repeated, we may ake maa A. 


i®) =7, Oh) 
mo = {4 for 0 <t < 7; 
0 for other values of (t) 


< 1] 2 ; 
I(jw) = xf. To(jo — jo1)H(jo:) dor 


Then 
Multiplying the above equation by 2/7; times its conjugate gives 


we I( joo) (jo) = sar, {— 4 SoA 
Jo) = gap, J, |, Le (Jwn — jo)Te(jo — joer) H (jer) H (Joon) deo doos 


Taking the ensemble average gives 


(Dai ( I meas} = Pra = ef i - . . 
Jah 20? Ty hi %0 }. co (Ta(jo2 — jw) To(jo — jo1))H (joi) H (—Jjws) dar dws 


A ‘ Peake Mes ; 
0 (1, (jw) 1.(Jo")) = r¥ji(jw)6(w — w'), the above equation becomes 
(Bi: j meas) = sd & a i (J j j 
(Je) meas) eT, | — am PHO — Jor)H (jor) H(—Jor) don 
1 La i 
se aire : sin w,7',/2\? 
rT i, Sale = sa) ey ) oe 
lhe above equation represents a spreading of the spectral density over a bandwidth 


of bf = 1/7. In both cases the | D i i i 
Oo Piel Red sat ph e best resolution obtainable is 6f. Therefore Af > of, 
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has proved more convenient to use than sound film. ane. 
necessity of using tunable filters or a large number of stoi e re : wt rh 
is the only method that has appeared in the literature. An alte aie 
method for the direct measurement of cross-spectral densities is t a ¢ 
the difference between the power spectra of sum and sort pari * 
Instead of surveying the literature on this subject, we sha ao 
| some detail an analyzer initially developed by the author se ms fe 
University and later completed under the supervision of J : : —_ 
of the Marine Division of Sperry Gyroscope Company. t is cu a 
in use for analyzing ocean waves, ship motions, and servo signa 


general. 


soe amnoduiauon'| > aaa 
. i Low-speed High-speed emodulati signal 
| a Aig recording -F— playback ithe 
modulator mechanism mechanism ilter 


| Speed-up 
signal 


Bandpass 
filter 
(f,) 


XY 


Square-law 
recorder 


detector 


Scanning- 
frequency 
oscillator 


D-c signal 
proportional to f-h 


Audio-signal 
frequency f 


Fic. 7-2. Block diagram of electronic spectrum analyzer. 


Figure 7-2 illustrates a block diagram of the device. There are two 


essential functioning units: 
1. Data Speed-up and Repeating Unit. 

spectral analysis is at the high x and supersonic region. 
ing- ion can be done in two ways. ; 
eM Re aacaee Magnetic tapes are much more ca in — 
ing frequency information than straight amplitude Te a oe a 
quency-modulated signal is recorded on the tape, and os i a 
for each sample record is made into a loop and playe “A he 
hundred to a few thousand times the recording speed. For Me 
applications the recording speed and center carrier pain aa 
tional to signal frequency range, while the playback speed 1s 

i spectral analyzer. - 

be ne svuaibeen is accomplished by varying o pon ‘ 
bias of a multivibrator according to the servo signal. if ze ‘ 
positive bias, a linear range of frequency swing of +40 per cen 


The best frequency range for 
The speed= 
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center carrier frequency is readily obtained.t The carrier frequency can 
be made as low as desired in so far as the electronics is concerned. How- 
ever, it must be high enough for the servo signal. The recording-tape 
speed depends on the carrier frequency. It is normally possible to record 
1,000 cycles of carrier on every inch of tape. The playback speed is 
limited by tape flutter and is much higher. Two typical sets of values 
are given in Table 7-1. 


TABLE 7-1 


Dervorsignal:eh,.cty. ose ee 0-0.5 cps 0-10 cps 
Carrier frequency.............. 12.5 cps 250 cps 
Recording-tape speed........... 11g in./min 34 in./sec 
Frequency swing............... 7.5-17.5 cps 150-350 cps 
PlayDaem epeeds sss. 4.2 4044 one oe 45 in./sec 45 in. /sec 
Speed=ip signal eo. 0-1.2 ke 0-1.2 ke 


With large-percentage frequency swing, the demodulation is best 
accomplished by a counter-type demodulator. For each zero crossing of 
the carrier signal, a voltage pulse of constant magnitude and duration is 
produced at the output end. A low-pass filter with a cutoff frequency 
between the highest signal frequency and the lowest carrier frequency is 
used to recover the signal from the sequence of voltage pulses. The 
filter is designed to have negligible ripple in its attenuation characteristics 
in the speed-up signal range. Since the power spectrum of a signal does 
not depend on phase, the filtering error is also negligible. In measuring 
(he cross-spectral density of two signals, the filtering error is again negligi- 
ble since there is no change in the relative phase of the two signals. 

PARALLEL PCM on MAGNETIC DRUM.{ With a magnetic drum, it is 
possible to record the signal with one set of heads and to take out the 
speed-up signal simultaneously with another set of heads. The device 
works as follows: Each sampled signal point is converted into a binary 
number. The n digits are recorded on n parallel channels, with one chan- 
nel for each digit. Suppose that there are N digit positions on each 


{ It is easy to see why the frequency-versus-bias-voltage characteristics are 
extremely linear. When the tube is under conduction, its grid bias is approximately 
0, At the commuting instant, a voltage —Ea far beyond cutoff voltage —E, is 
impressed on its grid. Subsequently the grid voltage follows an exponentially decay- 
ing curve toward the bias voltage Hy: e, = Ey, — (Ha + Ey)e/®°, At e, = —E,, 
(he multivibrator commutes again, and the half period is 

5 = RC log pie = RC log (1 a ) 
where « = (Ey, + E.)/(Ea — E.). 
fore > 2, 
| Stull being developed at the time of writing. 


The function 1/log (1 +- 1/x) is extremely linear 


. 
| 


(i 
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channel. The recording heads skip N digits and record on every 
(N + 1)st digit, while the playback heads read out every digit. By 
this means the sampled signal is speeded up by a factor N + 1. Use of 
parallel recording of the binary digits allows the maximum number of 
sampling points for a given drum circumference. é' It also simplifies the 
coding and decoding circuits as compared with a time division system. 

The system has two advantages: For analyzing the recorded signal it 
does away with the manual process of forming tapes into loops, which is 
usually a bottleneck in large-scale data-analysis operation. For adaptive 
systems, the spectral densities of immediately past data can be analyzed 
while present data are being recorded simultaneously. 

2. Spectral Analysis Unit. The repetitive speed-up signal is hetero- 
dyned with a local oscillator of slowly varying frequency which scans 


2 

5 Bandpass Mixer output 
= Speed-up filter 

e servo signal response i 

3 

a 

on 


: f, 
SS 
Frequency 


Pony 


Fra. 7-3. Spectral densities of servo signal and mixer output signal in the electronic 
spectrum analyzer. 


over the required range. To obtain constant conversion gain the local 
oscillator signal is of square waveform and is applied to the cathode of & 
balanced linear amplifier (mixer in Fig. 7-2) so that the amplifier is cut off 
every other half cycle. The signal being analyzed is applied to the grid 
of the amplifier, and a conversion gain of 1/m times the amplifier gain is 
obtained.t| The output from the converter is filtered by a bandpass 
filter with center frequency f, and detected by a square-law detector t 
give the spectral density at f,: 


f= (hf) (7-19) 


where a is the speed-up ratio and f; is the local oscillator frequency 
The relation is illustrated in Fig. 7-3. 


} Consider an input A sin wt. The output is A sin w{1 + f@\Ka/2, where f()) 
a square wave of unity amplitude. Expressing f(l) in a Fourier series, we ob 
immediately in the output product a term (KaA/m) cos (a = wt. If wo < om» 
or > w», the only term that can pass the bandpass filter is the term with o: = @ © 
All the harmonic terms are filtered out. 
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The combination of bandpass filter and square-law detector serves an 
essential averaging function. The repetitive signal has a fundamental 
period 7',/a, and its power spectrum is a series of lines &f (= a/7T) apart. 
As discussed in Sec. 7-2, the power of any given spectral line of different 
sample records of the same ensemble is distributed over a wide range, with 
its standard deviation equal to its mean value. When one or a few sam- 
ple records are used in the analysis, it is necessary to average the power 
content over a number of spectral lines to obtain any meaningful result. 
There are usually a large number of spectral lines in the passband of the 
filter, and the power content of each spectral line is modified by the 
attenuation characteristics of the filter. As the square-law detector gives 
the total power of the filtered signal, its output voltage is proportional 
to the weighted average of the spectral lines in the passband. 

The rate of scanning must be slow enough so that a nearly steady-state 
condition exists. Another way of looking at this is that, during the 
time of scanning over each filter bandwidth, the sample signal has to be 
repeated a sufficient number of times so that different portions of the 
sample signal are equally emphasized. 

The attenuation characteristics of the filter determine the weights used 
in the averaging process. While it is necessary to average over a number 
of lines to reduce the statistical variation, the averaging process tends 
so to smooth out any sharp variation in the power spectrum. The 
latter effect is called blurring error. It has been shown that the optimum 
lilter characteristics to give a minimum of blurring error for the same 
slatistical error, or vice versa, are the semicircular characteristics} of 
Wig. 7-4a. The filter emphasizes the spectral lines near the center of the 
passband and deemphasizes the components near the cutoff. While very 
sharp cutoff in the attenuation band is desirable, this requirement is not 
loo critical. Figure 7-4b gives the spectrum of the input signal while 
lig. 7-4e gives the spectrum of the output of the filter. 

The above describes the analyzer function in obtaining the spectral 
(lensity of a single signal. Figures 7-5a and b illustrate a method for 
wnalyzing the cross-spectral density $1.(jw) of two signals 74; and 4. 
‘The arrangement for obtaining the real component of ©1:(jw), or cospec- 
(rum, is shown in Fig. 7-5a. The sum and difference signals 14(¢, + 72) 
und '9(t, — 72) are analyzed, and the difference between the two spectral . 
(lensities gives the cospectrum. In Fig. 7-5b, the signal 7. is first inte- 


| The semicircular filtering characteristics can be approximated by a single-tuned 
fmonant cireuit in cascade with a double-tuned critically coupled resonant circuit. 
The Q value of the latter is 1.414 times that of the former. Let x denote (f — fr) /A, 
where 4 is a bandwidth equal to the radius. Then |H(jw)|? = 1/(1 + 2? + x4 + 
Wot) St 1/(L + a*)(L + 24). The frequency response of the single-tuned reso- 
hwnt civeuit is 1/(L + 2), while that of the double-tuned cireuit is 1/(1 + #4). 
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| H (ju) |? of bandpass 
filter (ideal) 


Actual filter 
yr 


(a) A 
ue 
T, 
ns 
(b) Spectrum of repetitive signal 


(after mixing) 


(c) Output of bandpass filter 
Fic. 7-4. Fine structure of the spectra of input and output signals of the bandpass 
filter. 
2i,(t) 


Real [ ®,9(jw)] 


Im [ Pyp(jw)] 


(b) 


Fic. 7-5. Arrangements for measuring cospectrum and cross spectrum of two sign 


grated to obtain a 90° phase shift at all frequencies, and the subseque 
operations are the same as shown in Fig. 7-5a. The difference betw 
the two spectra is multiplied by w to compensate for the f requency depe 
ence of the gain of the integration process, and the product is the 6 
spectrum, ‘The methods can be readily proved by applying lq. (Sel 
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An alternative way of determining the cross spectrum is not to inte- 
grate or differentiate 72 but to heterodyne 7; and 72 with different square- 
wave signals which are shifted by one-quarter period. The heterodyned 
signals are then combined and analyzed. While the latter method 
requires slightly more electronics, it does away with the necessity of 
multiplying the results by w or 1/w. 

7-4. Computation of Correlation Functions by Digital and Analog 
Methods. The correlation function between two signals 7,(t) and 72(t) 
(or the same signal) is computed exactly the same way as it is defined, 
except that the sample signal has finite length 7): 


Smetana f f= 
bul) = AORTA = pe [none + nae (7-20) 


T, may be determined by Eq. (7-17) or some other considerations. 
Equation (7-20) indicates three essential steps, as follows: 


1. Time shift: either advancing 72 or delaying 7; by an interval r 
2. Multiplication 
3. Averaging 


The above steps may be accomplished by either analog or digital means. 
‘The digital method will be discussed first. Since only a finite number of 
data points can enter into the computation, we select an interval 7’, 
T = T,/M, and take readings of i; and 7, att = 0, T, 2T,... , nT, 
e These readings will be denoted as 7:(n) and i2(n), respectively. 
Similarly, we shall use ¢12(h) to denote ¢12(7) at 7 = hT, and h = 0, 1, 
2,3, .... Equation (7-20) can be written approximately as 


n=M-h 


$12(h) ai Woe Z1(n)t2(n =a h) (7-21) 


n=0 


The crucial points in this method of computing the correlation func- 
tion are the following: 

1. The choice of T. Too large an interval results in unnecessary work, 
while too small a T results in mixing of the spectral densities. Let w, be 
(ofined as 2r/T. It is easy to show that sin wt, sin (w. + w)t, and 

sin (@, — w)t give the same set of readings. The same is true for the 
vosine functions. Once T is selected, there is no way of distinguishing 
spectral components of frequencies w, + w from that of frequency w. 
‘The criterion for determining 7 is therefore 


te ee (7-22) 


Wh 
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where w, is the high-frequency limit of the spectrum in the sense that there 
is only negligible power content _ Rea beyond w,. It may be 

i inspecting the sample signal. 
pone om ee witch length 7;. The choice of 7; is determined 
by the nature of the signal and desired accuracy. Because of the poten 
ness of 7:(¢) and i2(t) and the finite length of the sample, there is a random 
variation of the computed ¢12(r) from its expected value. For instance, 
no matter how large or how small 7 is, the mean-square values of 71(t) 
and io(f¢+ 7) are $110) and ¢22(0), respectively, and the product 
ix(t)io(t + 7) may take any value between plus or minus a few —_ 
/ $11(0) $22(0). While this scattering of values can be —— vy 
averaging over a number of data points, as shown in Eq. (7-21), t ; nu 
ber of independent data points is limited and is proportional to T1. 
we choose the interval 7’ between data points according to Eq. (7-22), 
the result is not appreciably different from that obtained by integration, 
since every bit of independent data is utilized or nearly utilized. 

It is shown.in Appendix C that the scattering of measured data points 
varies slightly with the delay time 7. For Gaussian random signals, the 
mean-square error of the measured correlation function 1s given as 


1 C-) 
([@12(7) meas — 12(7)]*) = oe he [d11(71) b22(71) 


+ diolt + 11) b12(7 — 71) dri (7-28) 
where ¢$12(7) meas is defined as 


Tit 
12(T) meas = raf 41(E)t0(E + T) dt (7-24 


The unlabeled correlation functions are the theoretical or true values f 
the ensemble. Equation (7-23) is obviously a counterpart of Eq. (7-15) 
There is a significant contrast, however. Equation (7-15) shows tha 
the standard deviation of the measured spectral density at a given f 
quency is proportional to the spectral density at that frequency. i 
tion (7-23) shows that the standard deviution of the measured correlati 
function depends on the correlation functions at all values of delay a 

Equation (7-23) holds approximately : the measured correlation fu 

ion is computed according to Eq. (7-21). 

Hae ‘How se values of ¢12(h) need to be computed? At first a 
the answer appears to be trivial. We just calculate ¢12(h) for all a 
of h starting from 0, 1, 2, 3, . . . until we come to a value m “a q 
¢12(h) is vanishingly small for all values of habove m. However, t 4 
false answer. Because of the scattered nature of measured @i2(h), ib 
not possible to tell from one sample alone whether the measured diy 
truly reflects the ensemble correlation function or 1s merely a ran 
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variation. This indistinguishability is especially true at the tail end of 
the correlation function. 

The above discussion points to the flexibility in the selection of m. 
It is not difficult to see that the choice of m corresponds to the choice of 
resolution Af in spectral-density measurements. We recall that spectral 
density is simply the Fourier transform of the correlation function, and 
the well-known rules between frequency resolution and duration of a sig- 
nal apply. If we compute the spectral densities from the measured 
correlation function which occupies an interval |r| < mT’, the best resolu- 
tion we can obtain is approximately 


Af= ae (7-25) 


Hquation (7-25) will be substantiated by a more exact analysis. Since 
Kiq. (7-15) holds equally well whether the spectral densities are computed 
from correlation functions or directly from the sample signals, the 
expected per-unit error of computed spectral density is 


By i eh, 7 
«= Vt af R M (7-26) 
Mquation (7-26) shows that, in order to have some degree of statistical 
reliability in the computed spectral density, m should be no more than a 
small fraction of M. On the other hand, m has to be large enough to give 
the desired resolution. 

Kquations (7-22) to (7-24) are based on the assumption that we are 
limited by the available signal sample and are doing our best in estimat- 
ing the statistical properties from the available sample. Another alter- 
native is that the available signal sample is unlimited but we are limited 
hy the number of computing operations, because of cost or other con- 
siderations. In that case we would do better by pacing the delay time at 
intervals 7 as given in Eq. (7-22) but pacing the time ¢ of data points at 
tnuch larger intervals, so that each pair of data points is completely inde- 
pendent of any other. The data-sampling process is shown in Fig. 7-6. 


iy averaging N values of 7;()i2(t + 7), the mean-square deviation of 
measured ¢49(r) is 


1 
([$12(7) meas — 12(7)]?) = N {$11(0) p22(0) + [¢12(7)]?} (7-27) 
Nquation (7-27) is derived in Appendix C. Since the correlation func- 
lions usually extend over an interval much larger than T, Eq. (7-23) gives 
considerably higher value for the estimated mean-square deviation. A 


similar situation is found in mining if we draw the analogy between 
(he statistical information contained in a piece of signal sample and iron 
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contained in a mine. Equations (7-22) and (7-23) represent virtually 
complete depletion of the mine, whereas Eq. (7-27) represents mining 
only the choicest ore. The former gives maximum yield per piece of 
mine, while the latter gives maximum yield per unit of effort. 


1 Ne +. 3° 


(b) 


Fia. 7-6. Two ways of signal sam 
length of sample record is limite 


pling in computing correlation functions. (a) When 
d; (b) when number of computations is limited. 


Averaging | 7 7(t) 
filter 


h(t) 


Fra. 7-7. Block diagram of continuous correlator. 


While digital computation can be readily programmed on a generale 
purpose machine, it takes readout devices to convert the sample signal t@ 
a sequence of numbers. The computational process is also time-con= 
suming. As an alternative, there have been developed various types of 
electronic correlators which compute the correlation functions by analo, 
techniques. We shall describe two types: 

1. The Continuous Correlator. A block diagram of the continuo 
correlator is shown in Fig. 7-7. The output of the multiplier is 


a(t) = a(t — r)tat) (7° 
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and the output y(t) of the averaging filter is 
ula) = f° elt — Caw) at (7-29) 


We shall require that the area under the h(é) curve be unity and that the 
correlator be in operation for a sufficiently long time, so that A(é) has 
negligible value for t > #;. Then 


(y,(ts)) = if (x,(t1 — U’))A(t’) dt’ 
= f° du@)h@) dt! = du) (7-30) 


In the above derivation, the expected value of «x(t; — ?’) i 
Iq. (7-28). As ¢12(7) is independ- . aiden ie | 
ent of ¢’, it is taken out of the A(t) 
integral sign. The balance of the 
integral is simply the area under 
the h(é) curve and is required to be 
unity. Equation (7-30) shows that 
y(t) is an estimate of ¢12(7). 

In general, the averaging filter is 
4 low-pass filter and h(t) has the Oo Tima t 
or i 
Beat eae i i :s ps ae Fic. 7-8. Impulse response of averaging 

filter. 

(o take the rectangular form of 
vurve b. It represents integration from 0 to T; — r. At the end of the 
run, or 4, = T1, we have 


1 Ti=¢r 
WT) = pF i, 2(T, — t’) dt 


A substitution of the dependent variable t’/’ = T, — t! — 7 gives 


1 Ti-—r 
y(T 1) T= ff a(t” + 7) dt!” 


1 (UN ae y 
; Ee if ar(t'’)ia(t”” + 7) dt!” 


lquation (7-80) is reduced to Eq. (7-24). 

As far as the instruments are concerned, the delay may be accomplished 
iy a delay line or by staggering the playback head if the sample signal is 
twcorded on tape. The multiplication is done by matching two square- 
law devices, as illustrated in Fig. 7-9. Each square-law device is made 
lip of a large number of diode shapers, and its accuracy can be easily made 
{o approximate 0,2 per cent. 
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Fia. 7-9. Block diagram of analog multiplier. 


£(t)+n(t) 4 (Etn)? 


2. The Pulse-type Correlator. One of the earliest electronic correlators 
built at MIT utilizes pulse multiplication. A series of pulses of height 
ix(t:), i1(t2), . . . are passed through a varying gate of duration i2(t: + 7), 
io(te +7), ... , respectively. The area of the resulting train of pulses 
is proportional to 7;(¢:)é2(t1 + 7). By integrating over the area of the 
pulses and dividing the result by the number of pulses, the average valu 
of i:(t)i2(t + 7) is obtained. Obviously, the spacing of data points is 
necessity larger than the delay time 7. As discussed previously, t 
method works best for direct analysis of live signals or when there 
unlimited length of sample record available. 

7-5. Computation of Spectral Densities from Correlation Functions. 
Once the correlation functions are determined by either digital or ana 
computation, the spectral densities can be calculated as follows: T 
known values are ¢, which represent measured ¢12(r) at r = AT, A = 
+1, +2,..., +m. Suppose that we wish to compute the spect 
density at w;. From the equation 


Py0( jw) = 2 1 12(r) e797 dr 


and by making use of all available data, we arrive at the following appro 
mate form: 
h=m 
oT ¥ pe sere 


h=—m 


(7-3 


However, because of the discreteness and finite range of measured bial? 
the above sum may or may not be the most desirable representation 
@,2(jw:). We shall denote it by the noncommittal symbol M (w) 
examine its significance. By definition, the expected value of $n is 
be KS Pe 

(on) = b12(hT) = ro ‘i Py2(ju)ere? dw (7- 

+ With minor variations, the analysis of this section follows essentially that 
J. W. Tukey, ‘Measuring Noise Color,’’ lecture notes for distribution at a meet 
the New York Section of the IRE, November, 1951, and The Sampling ‘Thee 


Power Spectrum Estimates, Symposium on Applications of Autocorrelation A 
to Physical Problems, Woods Hole, Mass., June 13-14, 1949, 
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From Egs. (7-31) and (7-32), 


h=m 
iP eo 
Com =D) [Xs uljnete~orr a 


h=—m 
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(7-33) 


The integral converges absolutely at infinity, since for all physical cases 
the spectral densities of the signals are limited within a finite frequency 
range. In fact, this is the condition upon which T' is selected [Eq. 
(7-22)]. The summation of finite terms in h can be taken up first: 

h=m 
e—im(w—o;)T — ei(mt1)(o—w))T 
ra 


ef(@—w)hT = 
h=—m 
_ sin (2m + 1) — w1)T/2 
ain (@ — 0) T/2 (7-84) 
Note that (# — w1)7'/2 is very small and its sine function is very close to 
the variable itself. Equation (7-33) can be written as 


(M(w:)) S [", &(je)W(w — 1) dey 


eit Nh sin (2m + 1)(@ — w1)T/2 

(w — a1) pe (7-36) 
Wquation (7-35) shows that (M(w1)) is not ,2(jw) itself but is the weighted 
average of the neighboring spectral densities. This is desirable from the 
standpoint of reducing statistical error, if W(w — :) has the proper 
waveform. 
. A plot of W(w — w1) versus w is given in Fig. 7-10. We note that there 
ix a large number of minor lobes, and the nearest minor lobes have a peak 


(7-35) 


where 


Nia, 7-10. Equivalent averaging filter of the digital computation method. Broken 


furve corresponds to Kq. (7-35) with 7 = (2 1)7/2; soli 
Ba, (7-88) with 9 02k 1 = (2m + 1)7'/2; solid curve corresponds to 


System Monograph 2606, 1957.) 


(Reprinted from H. Press and J. W. Tukey, Bell Telephone 
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1. Power-density Spectrum ®1;(jw). The following symbols are defined: 


M,=M(kAw) k=0,1,2,...,m 
& =M.(kdo) k=0,1,2,.. 


value of approximately —0.22 times that of the major peak. This is 
embarrassing, especially when the power-density spectrum @11(jw) is 
being computed. In some cases there is a sharp change in ®1:(jw) from a 
very low value to a very large value, and the computed (jw) at the low 


ee 
point becomes negative. In computing cross-spectral densities the error 


STEP @ 
is not so obvious but it is still there. One method of remedying this ham 
situation is to average over a few values of M(w:): M, =4T © 4 vs fies) ue :) 
M,(o1) = (1 — 2n)M(@:) + nM (a1 + Aw) + nM (w1 — Aw) (7-37) deal het 
Qa 
where Aw = Qm+)T (7-38) 


d, = 0.54, + 0.23(Misi + M;,-1) a= iB 2, see y mM 


&; are the measured values of ®1;(jw) at w = k Aw. 


and 7 is a small fraction which will be determined later. In Eq. (7-35), 
W(w — w1) is replaced by 


Ww — wo) = 1 - 2n)W(w — o1) + Ww — a1 + Aw) 
+ 7W(w — w1 — Aw) (7-39) 


We note that, by selecting Aw as given in Eq. (7-38), the minor lobes 


2. The Cross-spectral Density 


A, + 7B, = M(k Aw) 
P.+ 9Q. = Milk Aw) 


of W(w — 1) are matched against the minor lobes of W(w — a: + Aw) shai $ 
of the opposite sign. The fraction 7 is selected so that as much cane onhk 
cellation is obtained as possible. The centers of the side lobes are at A, = 27 cE a > (gn + $-n) cos om i) 
@ — w1 = (n/2) Aw, where n = 3, 5, 7,9,...-- At these points, the h=1 
magnitude of the remaining lobes can be readily evaluated in terms " 
the major peak: B, = 27 s) (d-n — $n) sin et 
Ao 2m+ 1 
STEP b 


~I 


Wee oe . Ahh (erate | 


The lobe at n = 3 is merged with the major peak, and we would do w 
to minimize the lobes at n = 5andn = 7. Setting the terms inside t 
brackets to zero at n = 6, we obtain 


P;, = 0.54A;, + 0.23 (Any + Ax-1) 
Q: = 0.54.B; + 0.23 (Br41 + By_1) 
Po = 0.54Ao + 0.464, 

Qo = 0 


Py and Q; are measured values of the cospectrum and quadrature spec- 


gc) Heed (rum, respectively, at w = k Aw. 
Teh a | Considering Eq. (7-38) and 7, = nT, Eq. (7-16) can be rewritten in 
4 aed terms of n and m: 
1= 77 = 0. eee + m 
Nn = Nn (7-41) 


where N is the number of sample records used in the analysis, ” is the 
umber of data points in each sample record, and m is the number of 
points in the resulting spectrum. From Eq. (7-41), we see that, in order 
\o arrive at any degree of reliability, the number of points in the resulting 
spectrum must be many, many times fewer than the number of data points 


we have to start with. The square root of the ratio of the two gives the 
probable error, 


From the above discussion, it is also obvious that the half bandwid 
of the averaging filter is Aw. There would be an adequate coverage of 
spectral-density curve by selecting o, at Aw apart. 

To summarize the above, the procedure of computing the spe 
density from the correlation function is listed below: 
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Therefore, if ®,, is known, ®,,(jw) can be calculated for various speeds of 
the plane. 

A similar case is a one-dimensional air-turbulence velocity measure- 
ment for estimating the forces and moments exerted on a plane’s surface, 
as illustrated in Fig. 7-12. Since the speed of the airplane is considera- 
bly higher than the turbulence propagation speed of the surrounding air, 


7-6. Spectral Densities with Independent Variable Other Than Time. 
Sometimes the measured variable is a function of space coordinates 
rather than time. One interesting application relates to the smoothness 
of arunway. The elevation z of the runway surface is a function of the 
distance x from the starting point. To estimate the effect of surface 
roughness on a plane, the spectral density ®,,(jk,) is computed with kz 
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hu lia, 7-12. Measured power spectra of atmospheric turbulence. (Reprinted from 


', Press and J. W. Tukey, Bell Telephone System Monograph 2606, 1957.) 
Fig. 7-11. Measured power spectra of runway roughness. o = mean-square vi 
of surface-height variations. (Reprinted from H. Press and J. W. Tukey, Bell Ti 
phone System Monograph 2606, 1957.) 


the latter can be considered fixed in position. The forces and moments 
(hat it exerts on a wing surface can be obtained by integration of the sur- 
luce pressure. Because of the motion of the airplane, the forces and 
moments become functions of time in exactly the same way as shown in 
Was. (7-42) and (7-43). 

7-7. Multiple-dimensional Spectrum. Sometimes the random varia- 
hile in question is a function of more than one independent variable. For 
instance, ocean-wave height is a function of time and the z,y coordinates, 
wid gust velocity of atmospheric turbulence is a function of time and all 
(hroe space coordinates. A full representation requires a multiple-dimen- 
sional spectral analysis, 


in units of radians per meter instead of the usual radians per second, 
typical spectrum is shown in Fig. 7-11. 

For a plane running at a speed V, the effect is exactly the same ts if 
were stationary and a movement z(t) were applied to its landings 
surface. The frequency w of 2(¢) is related to k, as 


w= Vis @ 
,,(jw) = io € °) 
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One central fact about multiple-dimensional waves is that wave motions 
of the same frequency but different directions of propagation exist inde- 
pendently. They do not add up to a single motion toward a single direc- 
tion with some vectorially summed magnitude. The waves propagating 
in various directions are just as independent as waves of different fre- 
quencies in the one-dimensional case. To put it analytically, let » be a 
random function of the independent variables 2, %2, . . - , Tn Inside 
the region 0 < 4; < X,7=1,2,...,7,0 can be expressed as 

. ¥ AW,Na, « 


eo 
ee N2 Nn 


The Fourier amplitude A(Ni,No, . . 


y= LN plier ert Nazet+Nwte) (7-44) 


. ,V,) is complex and is conjugate 


to A(—Ni,—N2, . . . ,—-Nn). A set of frequencies f; are defined as 
Oy N; 
f= i ~ 3 (7-45) 


Corresponding to the frequency range of the one-dimensional case, we now 
have a frequency region R, which is defined by the following inequality: 
n- Pence = ie bel east ic ty be ah DU (7-46) 

where Af is a small frequency range but large enough so that X Af > 1, 
The spectral density at f: = & is simply the total power inside R divided 
by the volume of R, in the limit that Af approaches zero and X Af” 
approaches infinity. 
Similarly to the one-dimensional case, the power of each spectral line if 
independent and has a distribution as given by Eq. (7-13), and the 
expected value H(N1,Ns, . . . ,Nn) is 


E(Ni,N2, ... Ns) = $a Po jonjen oe 


where w; = 24N;/X. The per-unit error ¢ is given as 


1 
© ~ (number of spectral lines in 2) 


as (X Af)- (7-48) 

Figure 7-13 illustrates a two-dimensional ocean-wave spectrul 
obtained by the stereophotographic technique. The stereophotograp 
give a complete record of wave height at one single instant over a cert 
area of ocean surface. The two-dimensional autocorrelation function 
the ocean-wave height is calculated from these photographs, and the tw 
dimensional spectrum is then calculated from the autocorrelation ful 
tions by a method similar to that of Sec. 7-5. The power spectrum 
represented by a contour map with the wave numbers in two mutt 
perpendicular directions as coordinates, as marked at the edges of the 
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tour map. However, in duplication of the cartesian-coordinate repre- 
sentation, polar coordinates are also given by the arrow pointing north, 
the angular markings from the north direction, and the scale above the 
origin showing wavelengths in feet. 

One complication in applying the ocean-wave spectrum to calculating 
forces and moments exerted on a ship is that the wave speed and ship 
speed are of the same order of magnitude. In contrast to the case of a 
plane speeding across a runway, we can no longer regard the ocean wave 
as fixed in space. Its relative speed to the ship as well as its change of 
frequency due to ship motion (Doppler effect) must be considered. A 
mathematical representation which takes these effects into consideration 
is given by St. Dennis and Pierson. 

7-8. An Appraisal of Various Methods. In the above sections we have 
discussed the few basic types or ways of measuring power spectra and 
correlation functions. One subject that we have not discussed is the 
use of correlation techniques in determining transfer functions. This 
we shall leave until Chap. 11. There is a considerable amount of litera- 
ture on the subject of correlation-function measurements. However, 
as this usually is concerned with detailed improvements such as better 
ways of multiplying, channel multiplexing, etc., and does not represent 
new basic methods, we shall not go further into the subject. It may we 
be said that many of the problems facing earlier workers have alread 
been solved by recent progress in electronic multipliers, integrator, 
etc., and no longer exist today. 

The methods that we have discussed so far are all for stationary signal 
For nonstationary signals, ensemble average over a large number 
samples by digital computation appears to be the only method. Ho 
ever, this is too straightforward to warrant any discussion. 

For stationary signals, direct analog measurement of power spec 
appears to be the best method. The design techniques of earlier chapte 
use power spectra directly. Even with the time-domain synthesis tee 
nique, the resulting equations for optimum system functions are express 
in exactly the same form, which of course should be the case since the t 
methods are equivalent. While it is necessary to express the po 
spectrum as a rational function in order to use the methods developed 
Chaps. 2 and 4, this has no loss of generality, since within the statist 
error limits a measured spectrum can easily be represented as a rati 
function of jo. 

Bearing in mind that the final data we use in the synthesis proced 
are in the form of power spectra, we can make some interesting com 
sons between power-spectra measurements and correlation-fun 
measurements: 

1. Frequency Range. The frequency range for power-spectra me 
ments with the heterodyne methods iy determined by the central 
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ee fs i the oe filter. The presence of any Fourier com 
in the signal with a frequency above fy will i ; 
aliasing. With correlation-functi he ioe 
on- on measurements, it is n 
oe the correlation time in steps. Let 7’ denote the faneane ‘ 
o . Then the presence of any Fourier component in the signal 
, , requency above 1/2T cps will result in frequency aliasing 

: ee requency Resolution. The frequency resolution for direct nope 
3 po rraagemnedel is simply the bandwidth of the filter, while that 
ion-function measurements is approxi 

is the longest correlation time (Sec. 7-5). ee 
Re frequency-range-to-resolution ratio for direct power-spectr 
i ysis _ about 100: 1 with ordinary LC for the bandpass filter, To 
obtain the same ratio by the correlation technique, one would hive te 


divide the range of correlation time i 
rareige Mine dce ime into at least 100 steps. This has 


Another element is time. As mentioned previously, even with the rela- 


tees here ee tape version, the power-spectra analyzer of Sec. 7-3 
ang and plots on paper a 100-point spect ithi oak 
It is very difficult to exceed thi nano ee 
s by first making c i i 
- with a high-speed digital computer. gbiginenereniess ais 8 
Be oe and Duplex Measurements. One difficulty in the 
f of Wiener’s prediction technique is the i 
range of power-spectra data requi i apiece 
of quired. Owing to the i i 
ferentiation process (or the ee 
é use of a lead network) the high 
signal and noise are very much ex ue Yih ie bl 
aggerated. Even if the high-f 
power contents of the signal and noise ar igi Miisids atoe 
e negligibly low by ordi 
standards, they are still ca i i smedi Le 
pable of influencing the i 
make an appreciable difference i i : cuin te 
ea e in the ideal characteristics of i 
prediction filter. Of course the requi highrise 
a , equirement of spectra measurement 
ae . a wide frequency range with large variations in power level can al d 
arise from other application problems ee 
( . i . 
a “hi = ae br} above with the spectra analyzer described 
Sec. 7- use a prewhitening filter before the signal 
and to record the signals with two di Soe eee ee 
o different tape speeds, usuall i 
‘ . . at 
of 10:1, The prewhitening filter has a frequency teaponse Bh ¢ 


|H (Jo) |? & G5 


\ > ] i 
: = ee : oa brings the power level throughout the frequency 
) erest to the accurate range of the anal 
es eee yzer. The slow-s 
Bre recording covers @ low frequency range in which the power phic 
- hy unwhitened signal is concentrated. The high-speed tape 
cording extends the frequency range by the speed ratio, and its fre. 


quene ion i i 
| y resolution is worse by the same ratio, The latter is not a serious 


SS 
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defect since the asymptotic characteristics of the armani hea of 
i tant than its detailed stru . 
the spectrum are generally more impor ; pee 
imi i d in Sec. 7-2, the predomina 
7-10. Confidence Limits. As discusse n Se wen 
i lue of ®(jw) is the random sta 
source of error in a measured va “ a 
iati igible systematic e 
tions of the samples used. If we assume neg 
ak other sources, Eq. (7-16) gives the rms per-unit sick Pe 
frequency range Af can be replaced by the bandwidth of the bandpas 


filter H(jw): Lf: Brn fate ay 


[,° (G2mNI af 


In order to obtain any acceptable degree of accuracy, the oa 
number of independent frequency Pik ee N ie m aie Sls ps bs 7 
j i istr ? 
asured (jw) has approximate y norma 
at Sec. A-10 can be used to establish its confidence interval. The 
inequality |V.. — V| < ain Sec. A-10 becomes 


Af = 


Dineas =: =] = a 
® 
It can be rewritten as ' 
I nn cc Rg (7-49) 


ey PACACIN) Fevers these ime 
The ratios 1/(1 — a) and 1/(1 + a) are called upper and lower confiden 
limits, respectively. o 
Example 7-1. The speed-up signal has a length of 0.5 sec, the bandpass : ‘-— 
handwidth of 80 cps, and the power spectra of five independent preg ee a 
hae identical circumstances are averaged to improve the reliability of the 


spectrum. Determine the 90 per cent confidence limits. 
Solution. The rms per-unit error of each record is 


ieee id 
BT tte Ne 


For p = 0.9, Table A-1 gives kp = 1.163. Therefore 


1 
= 1.163 x —_ x V4 = 0.1163 
a J eal Vi0 
= a 
a= GG 


There is a probability of 90 per cent that the actual (jw) lies between 0.895 and 1, 
times the measured value. - 

7-11. Summary. In this chapter, analog and sieve ae 
measuring spectral densities and correlation functions pa: oe 
cussed, Their comparative merits and the reliabilities of sue 
ments are also analyzed, 
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In making a Fourier analysis of a sample record of a Gaussian random 
signal, we find that the distributions of Fourier amplitudes are quite 
similar to the final distances arrived at in a random-walk problem. 
Mathematical analysis shows that, while the mean value of each Fourier 
amplitude is zero, its mean-square value is not, but varies at random 
somewhere between 0 and a few times 4;;(jw) /T;. This random variation 
gives rise to the uncertainties in the measured spectrum. The uncer- 
tainty, or expected error, is inversely proportional to the square root of 
the frequency resolution of the measured or estimated spectrum. Its 
reduction to the irreducible minimum is one of the major considerations 
in spectrum measuring or estimating techniques. 

To make a Fourier analysis of a sample record for every frequency 


is too time-consuming, and consequently two practical methods are 
developed: 


1. Direct analog computation of the spectrum, in which the speed-up 
signal is heterodyned, filtered, and detected by a square-law detector 
2. Computation from measured or calculated correlation functions 


When the second method is used, the frequency resolution of the spectrum 
is automatically determined by the longest correlation time used in the 
computations. This also gives some insight into the required resolution 
in spectrum estimates: If ,:(r) is expected to be zero for |r| > 71, there 
is no point in computing ®,(jw) for a frequency resolution better than 
1/7. 

Sometimes the independent variable is not time but a space variable z, 
or there may be more than one independent variable. Spectral densities 
in terms of wave numbers of these space coordinates are defined. The 
methods for estimating these “spectral densities” and the associated 
\incertainties from a given sample record are the same as before. 

The frequency range of the measured power spectra is limited either 
hy the center frequency of the bandpass filter or by the increment in cor- 
relation time. The frequency resolution is determined by the bandwidth 
of the bandpass filter or the longest correlation time. All considered, 
(lirect’ analog measurement of power spectra is the simpler and faster 
method, 

Prewhitening can be used to reduce the instrument error at frequencies 
where the spectral density is low originally. Duplex measurements with 
tocordings at two different speeds can be used to extend the frequency 
range. When used together, the two methods give accurate measure- 
Mmoents over a wide frequency range. 


A method for determining the confidence limits of measured spectral 
onsities is also discussed 


CHAPTER 8 


EFFECTS OF INACCURACIES OF SYSTEM COMPONENTS 


8-1. Scope and Purpose of Error Analysis. One problem facing @ 
system designer is that the actual system and the system he analyzed or 
specified cannot be expected to be identical. There is always the problem 
of the stability of a component over periods of operation, because of 
environmental factors such as temperature, shock, radiation, and atmos- 
pherical conditions. There are always the manufacturing tolerances, 
The dynamics of the controlled system also varies from time to time, 
because of loading or some other operating situation. Last but not least 
are the errors due to idealization of transfer functions of system com- 
ponents as well as the controlled system, for instance, the disregard or 
combination of small time constants or the approximation of distributed 
parameters by lumped parameters. These approximations are made 
in order not to complicate the design work unnecessarily. 

Therefore what he analyzed is an averaged and idealized system 
which we shall refer as the nominal system. At best, it is a close appro 
mation of the actual system. The pertinent question is: Could the act 
system with all its possible component variations still be expected to m 
given specifications? or, alternatively. What constitutes the least deman 
on the stability and tolerances of system components to do a given job 

The job requirements usually vary. However, we are in a bet 
position to answer these questions if we know something about er 
analysis. The subjects treated are as follows: 


1. Sensitivity of transient performance to component inaccuracies 

2. Sensitivity of closed-loop performance parameters (especially pol 
and zeros) to component inaccuracies 

3. Increase in mean-square error or some other cost function due 
component inaccuracies 


One assumption of the present analysis is that the inaccuracies are 
too high so that first-order variation can be considered adequate at 
estimate of the deterioration in performance. Otherwise, direct calew 
tions or one or the other of the self-optimizing schemes discussed in © 
10 should be used, 
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8-2. Calculation of Sensitivity Function of the Transient Response of 
a Linear System with Constant Coefficients by Transform Method 
The sensitivity function is defined as follows: Let x(t,a) represent tiie 
response of a certain system with parameter error a to a given input y(t) 
It is implied that x(t,0) represents the response of the nominal system, 
lor small values of a, x(t,~) can be written in a Taylor series: 


x(a) = 2(t,0) + [| at Ew Cees Bl) 


The important term is the linear term, and its coefficient is called the 
sensitivity function. For a linear system with constant coefficients, the 
system in question can be represented by a transfer function F(s a) 

which links the input y(¢) to the variable under consideration x(é). We 


have - 
a(ta) = £'{F(s,a) Y(s)} (8-2) 
The differentiability of Eq. (8-2) is assumed at present. Differentiating 


both sides with respect to a for n times and setting a to zero in the result- 
ing expression, 


d"x (t,o) OF "(s,a 
da” ned es ae [52 vo oe re it 2, +) + te 


Noting that F'(s,0)Y(s) = X(s,0), Eq. (8-3) for n = 1 can also be written 


as sli) age | [2m eee] x(s0)| 


‘The sensitivity f i i 
ay y function can be calculated from either Eq. (8-3) or Eq. 


(8-3) 


(8-4) 


Example 8-1. A simple RC network is subjected to a unit step input. 


Beasaple ‘ Determine 
(he sensitivity function to an error in the time constant. 


The transfer function of the 


network is 
Fis 3 ee 
a i+ (@ +a)s (8-5) 
Solution 
[=e] 2 § 
da Jao  (1+T7s)? 
rom Iq. (8-3), 
al 1 t 
-— — icaot | Ee —_ _ 
[ da Jano & { (ips To} es aa® ui (8-6) 
‘To check our result, direct solution from Kq. (8-5) gives 
alta) = 1 — eWt(T+a) 
Difforentiati | epressi ny es ‘ 
a ety ing the above expression with respect to a and setting a to zero give 


ee 
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Example 8-2. Suppose that, in Example 8-1, there is a parasitic inductance L in the 
RC network. Let \denote LC. The transfer function of the network can be written 


as 
1 
Foe) = 55 F Te +1 ea 


Determine the sensitivity of the step-response function to L. 
Solution. Equation (8-3) gives 


Bo ae {- 8? \ BAY (5 eit — er) 
OX Jr=0 (1+ Ts)? s T?\T 


Direct calculation from Eq. (8-7) gives 


ates oe —2t/T (+a) 3(; = ) —2t/T (1a) 
a(t,A) = 1 (2 +1)¢ +5 2 Liye 


(8-8) 


(8-9) 


where a = /1 — 4A/T?. For very small values of 2, 


x(tr) =1— (1 + 7) (1 — M)ewr + eum 
~1- eT + ‘ (j eT — ent + om) (8-10) 


We note that Eq. (8-10) agrees with Eq. (8-8) for all positive, nonvanishing values of t, 
However, at t = 0, Eq. (8-8) gives the value of the derivative as —1/T?, while Wqy 
(8-10) gives 0. The discrepancy at { = 0 arises out of some improper use of cale 
Tf we write the inverse transform of Eq. (8-8), 


[oat 
Or A=0 


culus. 
hy pie 


iota SAL seo st 
Qnj J —j~ @ + Ts)?° - 


we note that the integral is not convergent for t = 0. Therefore, the interchange 
differentiation and integration in deriving Eq. (8-3) from Eq. (8-2) is not legitima 


In practice, the problem of interest is the sensitivity function f 
t>0. At t= 0, the solution is no more than the initial conditio 
which we already know. However, one may ask if Eq. (8-3) is alwa, 
correct for t > 0. This point will be examined in the next section. 

The result may be mentioned here, however. If the parame 
errors or inaccuracies in system simulation are of such a nature that t 
do not change the order of s in the denominator polynomial of I’ 
Eq. (8-3) is always justified. If they increase the order of F(s), t 
following situations would arise: 

1. The inaccuracies do not cause high-frequency parasitic oscillation 
either permanent or transient; Eq. (8-3) is rigorously correct. 

9. The inaccuracies cause high-frequency parasitic oscillations of 
transient nature only; Hq. (8-3) neglects such transient oscillations & 
gives the error in the measurable averaged response. Since the paras 
oscillations are usually very fast and are not detectable by ordi 
instruments, we should expect Eq. (8-8) to agree with the measure 
It is justified physically though not mathematically, 
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a The inaccuracies cause sustained high-frequency parasitic oscilla- 
tions. Since these oscillations almost always cause secondary effects b 
saturating the system components, redesign of the system is called for ‘ 

In all, we may say that Eq. (8-3) is good for error analysis of eyatetnd 
without sustained parasitic oscillations. The issue of possible parasiti 
oscillations has to be examined separately. ; . 
8-3. Convergence Problem of the Transform Method,:>0. Fora 
linear system with constant coefficients, the transform funotion F(s,a) 
is a rational function in s. Consequently, we may write 


ae No(s) + Ni(s,a) 
#6) Do(s) + Di(s,a) 


ee N eee OR D,(s), and D,(s,a) are polynomials in s. The 
polynomials Dj,(s,az) and Ni(s,a) are assumed to h i 
inl ; o have the following 
1. As a approaches zero, all the coefficients of 
approach zero. er RAS eta 
2. All the coefficients of Ni(s,a) and D,(s,a ic wi 
. ja) are analyt 
to a in the vicinity of a = 0. sage re ieee 
Let N 1 (s,a) and D,(s,a) denote the partial derivatives of Ni(s,a) 
and Dyx(s,a) with respect to a. Ny (s,a) and D,(s,a) are two pulys 
nomials in s of the same orders as N,(s,a) and D,(s,a), respectively. 
Condition 2 implies that all the coefficients of Ni (s,v) and D,(s,a) 
are continuous at a = 0. We assume further th i nies 
scabs er that Y(s) is a rational 
For t > 0, the inverse transform can be written as 


(8-11) 


1 
x(tja) = ay [ Foo) Y(s)e* ds (8-12) 


where C is a contour enclosing all the poles of F(s,a) Y(s) 
of F(s,a) is 


The derivative 


OF (8,0) od [Do(s) + Di(s,a)]Ni™(s,a) — [No(s) + Ni(s,a)]Di™ (s,c) 
Oa [Do(s) + Di(s,a)]? 


(8-13) 


If Do(s) is a polynomial of equal or higher order than D,(s,a), obviousl 

the highest-order term of Dy + D, cannot be zero in the vend Le ¢: 
If the order of Do(s) is lower, the highest-order term in Dy + D, is the 
highest-order term of D;(s,a), the coefficient of which is, from our oneiaine 
lion, equal to zero at a = 0 and is analytic in the wiebuidiy of a=0 
Unless all the derivatives of the coefficient with respect to a vanish (andl 
vonsequently the coefficient is identically zero in the vicinity of a = 0 

contradicting the assumption that it is the highest-order term), there is ‘ 


—————— Te 
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vicinity around a = Oin which the coefficient is not zero except at a = 0. 
Let us consider a being any point in this vicinity but not at a = 0. 
Since the coefficient of the highest-order term in Dy + Dy; is not zero, it 
follows that all the roots of Do + D, are finite, and the contour of inte- 
gration required to enclose all the roots is of finite length. From Eq. 
(8-13) we see that the value of (0/da)F(s,) Y(s) is bounded everywhere 


on the contour. Therefore 


1 0 ; spas st 
i i 2 F(a) ¥ se" 8 = 355 5a Flea) ¥ (ser ds 
0 
= 5. x(t) (8-14) 


The above argument can be repeated to obtain 


a” Pte Me ae a9 a . 
aan ttre) aoe Lae [F(s,a) ¥(s)]e" ds (8-15) 
From Eq. (8-15) we can write 


o” ‘ i 
Fe v(t) | = bm oj 


The problem now is whether the limiting process and integration sign 
can be interchanged. With reference to Eq. (8-13), the contour integral 
of Eq. (8-15) gives a number of exponential terms with polynomials of ¢ 
as coefficients. If D(s,«) is of the same or lower order than Dy(s), the 
number of exponential terms is conserved no matter which way we do it, 
Equation (8-13) gives a number of poles and associated residues af 
functions of a. If we integrate first, it means that we write the exponen 
tial forms and then let @ in the time constants and coefficients appromt 
zero. If we take the limit first, it means that we let a in the time con 
stants and coefficients approach zero first and then write the exponenti 
terms. There is no difference. 
A problem arises, however, if D,(s,e) is of higher order in s than Do(s) 
By taking the limit first, we are throwing away all the exponential ter 
not arising out of the roots of Do(s). The question is: Can these expone 
tial terms be thrown away? 
To answer this question, let us examine the roots of Do(s) + Dy (sa) 
Let n and n + m denote the orders of sin Do(s) and D,(s,@), respectivel 
Proposition 1. As a approaches zero, n rools of Do + Di approach 
roots of Dy and m roots of Do + Di approach infinity. 
The first statement is easy to prove, since D,(s,a) approaches Zero 
a approaches zero, ‘To prove the second statement, if there is any fi 


| & F(s,a)¥(s)e" ds (8-16) 
c 0a” 
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root s; other than the roots of Do(s), then 
F Dy(s1) + D,(s1,a) = (0 (8-17) 
ie ten coefficients of D,(si,) approach zero and s; is finite, D,(s,,@) 
“ ne zero. ‘ inet hue = 0, which contradicts the abstitee 
1 is not a root of Do. Similarly, by differentiati 
ot A ating D D 
“ with respect to s, it is shown that the multiplicity of ste of ae 
cannot be increased. Since the m extra roots must go somewhere, thi 
proves our proposition. sae 
The m roots which a infini 
pproach infinit i 
referred to as the m roots. TAR APNE At 
i ris Ay said 2. As a approaches zero, the magnitude of the m roots as 
: pie , yey ig apart components approaches powers or fractional 
a, and the residues of the m root } } 
faster than a finite power of 1/ca. 1 ea 
2 payee coefficients of Di(s,a) and Ni(s,a) are analytic in the vicinity 
when dpe ire can be approximated by the first nonvanishing 
its Taylor’s series expansi i i i 
Piles hoe pansion, and the result mentioned in proposi- 
Now we are in a position to examin igni 
e the signific i 
order of integration and limiting process: : weet 2 
1. If all the real components of th 
e m roots approach ive infini 
as a approaches zero, for any finite t > 0 oi hig in 


‘ 1\¢ 
lim {—) e-(/a’t = 


pase how large the constant a or how small the constant b as long 
; . ; _The neglected exponential terms vanish quickly, and the inter- 
( ane of integration and limiting is justified. 
aa . some of the m roots have constant negative real components and 
om wie positive or zero real components, as a approaches zero the 
* an vie constant negative real components introduce transient oscil- 
: . ms 0 very high frequency. We refer to such a system as having 
ores parasitic oscillations. The designer may consider such oscil 
eos as objectionable and redesign the system. Alternatively, the 
‘ ei may tolerate the transient parasitic oscillations as they ube too 
on ‘eg affect any measurable results. Our interchange of integration 
a | meshed ii the parasitic oscillations and gives the sensitivity 
‘tion for the measurabl i 1 i 
‘ath e part of x(t,e) only. It is exactly the desired 
3, f some of the m Toots have zero or positive real components, sus- 
ined parasitic oscillations would result. A redesign of the syst 
he necessary. pee rt 
lhus we come to the conclusion at the end of Sec, 8-2. 
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Example 8-3. The Do(s) of a system is 1 + 2¢s + s%, and D,(s,a) may take any of 
the following forms: 


. a(s? + s’) 
. a(s? + 84) 
rea? = a8* 
5 


wn e 


as 


Determine the nature of parasitic oscillations, if any. 

Solution. Since we are interested only in the nature of the roots in the asymptotic 
limit of very large s and very small a, only the highest-order term in D,(s) needs to be 
retained. ‘The other terms in Do(s) are negligible. In D(s), only the terms of higher 
order than Do(s) need to be retained. Therefore, as a approaches 0, and s approaches 
infinity : 


Approximate form of Do + dD Asymptotic roots 


1. s? + as? -* 
a 
2. s? + as? + as* = +ivi 
3. s? + as? + a’s4 keiale 
2a 
eee -a-% are an 


Cases 1 and 3 do not have parasitic oscillations. Case 2 has transient parasiti¢ 
oscillation. Case 4 has sustained parasitic oscillation. 


8-4. Sensitivity Function of Transient Response of Nonlinear Systems 
and Linear Systems with Time-varying Coefficients. For linear systema, 
we may calculate directly and obtain an analytical expression of x(t,a), 
The sensitivity function dx(t,«)/da is calculated from x(t,«) by differentiate 
tion. While this procedure would be intractable for all but the simplest 
systems (for instance, all four cases of Example 8-3 are too complicated 
for direct calculation), it nevertheless offers an alternative to the transform 
method. 

For nonlinear and time-varying systems, the equations are usually 
solved by numerical integration or simulation on an analog computer. We 
we make two runs, one with a equal to zero and one with a equal to so 
small finite value, and take the difference, it is obvious that the errors iff 
the computations must be much less than the errors introduced by a, if 
the sensitivity function so obtained is to mean anything at all. To pub it 
another way, the accuracy of the measured sensitivity function is fw 
from as good as that of the computations. 

Miller and Murray developed a method which makes possible a dire 
analog run of the sensitivity functions. Consequently, the sensitivi 
functions so obtained are as accurate as the computation process, 7 
work has been intended for analyzing the errors in the output of an ant 
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computer due to errors in simulation. 


( The sour i 
into the following categories: mabictindersa ives <7 


1. @ errors: inaccuracies in s i ; 
: ystem simulat 
order of the system ation that do not affect the 


2. 8 errors: external disturbances 


3. A errors: inaccuracies i i i 
RR icles n system simulation that affect the order of 


In terms of Miller and Murray’s classification, component inace i 
due to manufacturing tolerances and changing arvironnvontel ab er) 
sources for @ errors. Idealization of the transfer function gives vis GH 
errors. The # errors are what we refer to as noise and load dist ny: 
a the latter effects are considered inputs in our analysis and fied ye 
ie gia ay in previous chapters, we are concerned here with only the a and 


The differential equations describi i i 
tet ibing the theoretical or specified system 


F (41,42, See Fee a wary He 


be serio it] 4 = Ny 25 By hss jon (Sans) 
3 n simultaneous equations are perfectly general and may represent 
either linear, nonlinear, or time-varying systems, depending on the form 


Fy = && — Tey Ie MAD. Sys a 


Then in the remaining n — i i 
the gn — m equations, 2x;,41 simply repres 
time derivative of x1, for all values of k fioin 1 to me iar os 


Let us consider the a errors first. The actual system can be repre- 


sented as 
lite, .. , nC key» > «, Laity » ». al = 0 
where #=1,2,...,” (8-19) 
BN Ai Sais sages s: pet) 70 D 
SH Piha, Wi Ghat ites. oe Scat) (S220) 


a solution of Eq. (8-19) gives 2; as a function of ¢ and the a’s: 
ne: - + + Om). Let x,(t,a) and 2,(t,0) be the abbreviated forms of 
a 4 mm o ae and rey ... ,0), respectively. Because of 
iq. (8-20), 2:(4,0) is a solution o Eq. (8-18). For small val 

08, Ui(tay,a2, . . . ,a@m) can be written as ne Ae 


k=m 
wi(tya) = x,(t,0) + > ES OK (8-21) 
rey aon 


——— OO  “~-— 
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i i ] zero. 
The partial derivatives are evaluated by letting all the " ' — i % 
Let w(t) denote these derivatives. Equation (8-21) can 


k=m 
a(t) = ai(t,0) + », wir(t) or (8-22) 


‘ i 
The functions w(t) are generalized forms of transient-response sens 
From Eq. (8-19), 


Yael) 4 =o 


Ov; Oa, Ook 


tivity functions. 


y & dat; (t,x) ee 
“On; OAK 
pine ee eee 


i a) 
The above equation holds for all values of a’s. In particular, for zer 


values of the a’s, 


aG; _ OF; AG: _ AF: 

a (8-24) 
aa;(t,or) = 8 dx;(t,a) = Wir 

dar Ot aK 


Substituting Eq. (8-24) in Eq. (8-23), we obtain 


oF a oF a 26) = 
j 


i d to 

The functions 2:(t,0), which are solutions up tie Mai ae ae s 
itivi i w., can be so 

be Bae is eh es us now examine the nature 


1, 2, 


‘= r” (82a 
0 k= a 2, m 


eee wae 


be known. 
(8-25) without computing 2:(t,«). 
tions: | 
oo. tia Equations (8-18) are linear with constant meee oa 
artial derivatives aF;,/d%; and dF ;/dx; are ee : bh ed 
ao the same linear equations with constant coefficients. 

i re in the inhomogeneous terms. ; a 
iti ‘a pheno (8-18) are linear with time-varying pa 
Equations (8-25) are linear with the same time-varying coelle 

i inhomogeneous terms. as ; 
ping sama (8-18) are nonlinear. Since oF :/ dx; and be 
Hehlund me the 2;(t,0)’s, Eqs. (8-25) are linear equations with time 

t\%y 
i % 5 . . 
te nea values of wj, are known from an inspection = the Oa 
and Eqs. (8-25) allow the subsequent values of wjx to be dete 
valculation or simulation. oe 

Saale and Murray’s work, the d errors are oe - phi . 

involved way than the @ errors. We shall show how the A 
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treated in exactly the same way as the a@ errors, except for the initial 
conditions, and then we shall show how the proper initial conditions (at 
t = « > 0, not t = 0) can be obtained. The method is given as follows: 
Suppose that, because of the \ errors, the actual system equations (8-21) 
depend on #1, #2, ... ,@,. We may rewrite Eq. (8-19) as 


F (41,22, ee ae €an,01,09, . 


peniber tees) e=1,2,...,2n (8-26) 
so that for 1 <i <n, andn+1<k < 2n, F; is independent of all 2, 
and z. Forn +1 <i < 2n, F; is given as 


F; = fn — % a=n+1,n+2,...,2n 


Under these conditions, Eqs. (8-26) are the same as Eqs. (8-18). 
other hand, the functions Gi(a1,2, . . . ,ton,®1,02, . . . ,2onj01,02, . . .) 
may depend on én41, @n42, . . . , £2n When not all the a’s are zero. The 
net result is that the second derivatives #1, #2, . . . , #, are introduced 
into Eq. (8-1) while the formalism of a errors is still followed. The above 
procedure can be extended readily to higher derivatives. 

From the above discussion, we see that the errors can be written and 
treated as a errors. A general analysis of the a errors is sufficient for 
treating both the a and ) errors. However, there is a point of reserva- 
tion. Att = 0, not all the w’s are defined. For instance, let us consider 
the problem of Example 8-2. The differential equation of x(t,A) is 


Az + Te +z = u(t) 


where u(t) is the unit step function. 
method, Eq. (8-28) is written as 


(8-27) 
On the 


(8-28) 


According to the above outlined 


F, = G2. = 4, — 2, = 0 


We note that, with 4 = 0, 2 = 1/T att = 0. With A + 0, no matter 
how small, z2 = 0 att =0. The function 


w(t) = (3) 


cannot exist at = 0. Consequently, Eq. (8-25) is not valid at ¢ = 0. 
‘This is also true quite frequently in the general case. While Eq. (8-25) 
in still valid for ¢ > 0, its failure to hold at ¢ = 0 makes the initial con- 
(litions at £ = 0 useless, and they must be replaced by another set of con- 
(litions, initial or otherwise. One readily available set can be obtained by 
(he transform method for ¢ = ¢, where ¢ is an arbitrarily small positive 
constant, Since Hq. (8-25) can be extended to as low a value of t as we 
Wish, the initial conditions at ¢ = ¢ are adequate, 
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he method is as follows: “ne r ' 
; dashienoatec the system equations at the vicinity of t oF 0 “6 aa 
eceesions in the 2’s. Approximate the inputs by functions tha 
i i hss 
formable to rational functions ° | 
Te etc out the transform equations and solve for the X;(s,«)’s. 


3. Calculate Wixe(s) as 


aX;(s,a) 
van = PL 
These are rational functions in s. Wi(s) may be written as 
N(s 
Wald) = Bey + 


where N(s), D(s), and Q(s) are polynomials in s, N (s) ‘a ei ae a 
be 0, 1, or any poly ? 

lower than D(s), and Q(s) may , A, or: a 

ie N, 1 D is called the principal part of Wix(s) and is denoted as ae a 

We note that, as Q(s) does not contribute to wix(¢) for any ¢ , Eq. 

(8-3) gives hi ede di 

However, it is not necessary to evaluate the inverse transform. The 

initial-value theorem gives 


wx(0+) = lim sP[Wix(s)] (8-30) 


Equation (8-30) is to be used as the initial conditions. 


Example 8-4. As an example to illustrate the above, consider the linear equation 
(8-28). Equations (8-25) and (8-29) give 


wi + wi + a2 = 0 


Solving the equation T#: + %1 = u(t), we obtain 


Equation (8-31) gives 


w(t) — meet + AetT 


where A is an arbitrary constant. To determine A, Eq. (8-28) gives 


ti =e 
Wis) = Gp Tes G+ Ts)? 


Equations (8-30) and (8-32) give : 
wi(0t) = wind 
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Substituting Eq. (8-34) in (8-32), we obtain A = —1 /T?, and 


wilt) = . (5 eT er) (8-35) 


which agrees with Eq. (8-8). On the other hand, it is obvious that x(0,) is zero no 
matter what d is. 


(0) = FO” = 0 (8-36) 


If we use Eq. (8-36) as the initial condition, it follows that A = 0, and the result is 
erroneous. 


8-5. Sensitivities of Closed-loop Poles of a Linear System. When we 
talk about the performance parameters of a closed-loop system, we may 
be referring to two different types of parameters: 


1. Parameters that govern the general characteristic of the response 
curve, e.g., natural frequency and damping 


2. Response to a particular input at a particular time or frequency, 
e.g., peak overshoot or maximum modulus 


The first group will be discussed here while the second group will be 
discussed in See. 8-6. 

The natural frequency and damping are well defined for a second-order 
system but are not so well defined for a system of third order or higher. 
In general, we may say that the terms refer to the locations of the pair of 
control poles in a closed-loop system. By giving an explicit expression for 
the position shifts of these poles, the problem is solved. 

Let p; denote the poles of (C/R)(s), and let a; denote the variable 
parameters of system components, where 7 and j are running indices. 
lor small variations in the a’s, the variations in the p’s can be expressed 


1s 
Pie. as Fm 
Op; Paes > da; 6a; (8-37) 


3 


The partial derivatives dp;/de; are evaluated for the ideal condition 
ba; = 0 for all 7’s and are defined as pole sensitivities to the parameter a;. 
(The definition of pole sensitivities is not unanimous in the literature. 
llowever, as one can be easily obtained from another, the exact definition 
is not significant.) The closed-loop function can be written as 


K |] (s — z) 
= (s) = —- k is 
Z ll (s = pi) 


(8-38) 
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Fra. 8-1. Block diagram illustrating a prototype system with both series and shunt 
compensation. 


Since multiple roots in (C/R)(s) are rather unusual, especially for the con- 
trol poles, we assume that the poles and zeros are distinct. With refer- 
ence to Fig. 8-1, (C/R)(s) can be expressed in terms of the transfer func- 
tions of the components as 


=1+4+ H(s) + @(s) (8-39) 


Ppt took 
(C/R)(8) 


Taking the logarithm of the above equations, we obtain 


>, log (s — pi) — ¥ log (s — %) — log K 
i k 
= log [1 + H(s) + @-“(s)] (8-40) 


Differentiating Eq. (8-40) with respect to , multiplying the result b; 
—(s — p,), and setting s equal to pi give 
C 
90; ti [« — pi) R |_| | 
The expression in the first brackets is the residue of (C/R)(s) at 8s = 
Denoting it by Ri, Eq. (8-41) becomes 
| ( 


ae 2a 
In series-compensated systems, H (s) = 0, and Eq. (8-42) can be writ 
is a pole of (C/R)(s), 


0a; 
in a different form. We note that, since p; 


0H (pi) os 


0a; 


a | ae 


dG—1(pi) 
0a; 


dG—!(pi) 


0a; 


“J 


G(p) = —1 
Equation (8-42) becomes 
Opi + R; aG-! (pi) s : 0 Pere oe 
da; G (pi) da; - ea log G-'(pi) ( 


For a series-compensated system, G(s) usually represents a product of 
form G4(s)G2(s)@3(s) °° * Equation (8-43) allows the irrelevant 
drop out of the computation. 


EFFECTS OF INACCURACIES OF SYSTEM COMPONENTS 205 


Example 8-5. The open-loop ti i 
Pri orton p transfer function of a lead-lag compensated system 
— 20(1 + 0.5s)(1_+ 2s) K 

(1 + 0.1s)(1 + 10s) @ + Tis) + Tos) 


The nominal values of K, 71, and T 
vba I, are 5, 0.5, and 2 sec, respectivel D i 
Te . Det 
the natural frequency and damping of the closed-loop system as ab thei ee 
tivities to variations in K, 71, and T». sealer 
Solution. Under the nominal condition, 


G 
ie (8-44) 


fo LE. 
©) (1 + 0.1s)(1 + 10s) 

c oe 100 

Rk s? + 10.1s + 101 


Therefore, 
wo = V101 = 10.05 
= Ton: 
= 0.5025 = cos 59.8° 


~ 10.05 X 2 
10.05 /120.2° =p. = 10.05 /—120.2° 


Pr 
Let us consider the pole p;. Its residue is 


100 


1= i co ie = —J5.75 

From Eqs. (8-43) and (8-44), 
Op _ 1 ID7B 
se ki (- z) = PS = ji.i5 
Op, _ R Pi - —7j5.75 es rs 
aT Titan Oe - ame LBZ 
ap. _ 95.75 . 
ats 1.08 — 70.086 ™ 7-95 [—87.5° 


|6P,| sin (@-¢) 


|4P, | cos (6-¢) 


Vio, 82. Vector diagram illustrating relationship between the vector 6p, and the scalors 


fw nnd by, 


———— Te 
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206 in deuce Let F and F represent (C/R)(jwm) and (C/R)(— Jom), respectively. 
From Fig. 8-2, Equation (8-48) can be written as 
duo = |Bp.| cos (8 — 4) OM _ 1 8 pp  _ MO pet 
: d cos 6 = sin 6 66 = — Lepsl min @ vie (2 = #1) da; 2M da; uf) 2 da; pes 
us ; e 
= —WM X real component of |? (2 + Ee | (8-50) 
Therefore, P i ) (8-45) 0a; 0a; 
wo _ | OP1 ‘i. : 
rae | da | e08 wa (8-46) Yor a series-compensated system, Eq. (8-50) becomes 
ag _ 1 | 9P1) cin 6 sin (0 — ¢) cs 
da  wo| da 2 = —M X real component of (? ce 
where a may be any parameter K, Ty, or T>. y 1 


Equations (8-45) and (8-46) give 


(0/dai) log G( jam) 
M X real component of | Ee (8-51) 


Example 8-6. For the problem of Example 8-5, determine the maximum modulus 


80 _ 415 cos 30.2° = 0.993 sec™ 
aK 


115. hie 2° = —0.050 and its sensitivity to a change in 7). 
aa ae acne 120.2° sin 30 Solution. For the nominal system, 
dev = —11.8 sec" or, = 2:62 sec"? wm = oo VI — 2 = 0.704w = 7.08 
Ci 1 100 
or BF Se OR EE 
a = 0.352 seo“! 5p, = 0.117 see 2 V1 — 2101 
1 


Modulus and Peak Response of a 


, To calculate the sensitivity of M to T;, we evaluate first 
dulus M is defined as the maximum 


8-6. Sensitivities of Maximum 
Linear System. The maximum mo 


: 100 
. = Onse G(jom) = =—— se oe = 11 —124.5° 
C/R)(jw)| over varying frequency. Since the peak-resp (em) = GF 50.708)(1 +9708) = 1-153 /—124.5° 
| value of |(C/R)(Qe)| | the parameters a; vary, we have Slog fae i, wigs pind 
frequency wm also varies as C a 7 eT, 1+ Tiyan 
ts) . Om (8-4 
aM _ a|Cy, ea we) | Gn bquatt 
a se (jem) | + Pact Re 6) da; Nquation (8-51) gives 
constant wm vad = M X real component of 1.90 /—94.2° = —0.16 sec7! 
i | 
r, as 
However, B Cc (jem) | = 0 8-7. Physical Significance of a Cost Function. Another phase of 
dom | R 7 


error analysis is to take a direct look at the deterioration of system merit 
(ue to component inaccuracies. For a system without power constraint 
(«.g., optimum filtering or prediction) system merit can be taken as 
c (jom) | (8-4 synonymous with negative mean-square error. All we need to do is to 
R examine the increase in mean-square error. For a system with power 
sonstraint, the criterion is not as simple. The calculated mean-square 
error e® can obviously be decreased by increasing @?. The sum e + k’a’, 
liowever, is always on the rise. Its increased value can be taken as a 
\\uantitative indication of the deterioration of system merit. If the con- 
slraint in a? is due to heat-loss limitations, we may regard an increased 
yvulue of a? as an increased cost, since higher temperature reduces the life 
pxpectation of the hardware. The sum e? + ka? represents total cost in 
forms of both the expected error and expected life. It will be defined as 
the “cost function” and denoted by f.. 


by definition of wm, Eq. (8-47) reduces to 
oM te) 


dar dai 


i f the nominal system. 
i 8-48) is evaluated at the wm o aa 
ee cca xe cp is defined as the peak value of the uni 


response c(t). Similarly to (8-48), we have 
dc, _ Ac(ty) (8 


0a; 0a; 


WwW i y Hquw 

here t is the time of peak response of saaoarreer wie at) 
i itivit o that of eva i 

8-49 edu 208 the peak-response sensi y ih 

eas a ty which can be done by the transform method of Se ‘ 
‘) »Upy 0 


—— OOOO 
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Alternatively, the pow 
calculated e? is lower than t 


defined as 


er constraint may be due to saturation. The 


Equation (8-57) shows that 4f. is an approximate representation of the 
he actual @. Let the symbols 4, », and w be 


increase in actual mean-square error éu. 
8-8. Increase of Mean-square Error and the Cost Function. There are 
two types of variations which may cause a change in the mean-square 


oe 
u = actual e é 
error or the cost function: 


» = calculated e? on linear basis 
ited 1. Variations in the spectral densities of the inputs and in the controlled 
system 


2. Variations in the compensating elements 


increase in wu rather than in v. For 


: ; in the ‘ 
Obviously we are interested in considerably below its saturation 


a system with random inputs and a 
value, u can be written approximately as 


u=v+ U(w) 


Variations of the first type change the assumed condition of the optimiza- 
tion calculations. The cost function may be either increased or reduced, 
depending on what is changed in which way. The change in cost func- 
tion is proportional to the parameter change, as long as the latter is 
small enough. Variation of the second type does not change the condi- 
tion of optimization, and consequently the cost function can only be on 
the rise. The change of cost function is proportional to the square of 
the parameter change for small variations. 

So far as computation is concerned, the first type is routine. We 


(8-52) 


. : ing function of w. ae 
where U(w) is an increasing Tu D lected that wis a mininullil 


2 is so se 
+, we assume that the level of a is 
aN gad F(s,ko) denote the corresponding beer a _ and opti an 
i ively. For a slightly different * = %o 
tem function F(s,k) respectively : 
is is a slightly different F(s, ko + 5k), which gives rise to Sa 
variations (6): and (6w)1. Since ko is selected to give mini 


ie Bae He simply calculate whatever we wish to calculate in terms of the variable 
bu = (50)1 + 5 (80) = 0 (8-5 parameters and then take the derivatives. The second type is more 
dw interesting. We shall derive a method which is faster than direct 
: ‘ i have computation. 
; ko) is an optimum system function, we p ‘ ’ 

However, since F'(s,ko) ‘ 2s ( As the mean-square error in a system without power constraint can 
(dv): + ko?(5w)1 = he considered as a special case of the cost function with ko = 0, we shall 

es wi, (8-53) and (8-54) gives consider the cost function only. In general, it can be represented as 

1 [i “4 . 

pie oe ( fo ag / (©) + BF + OF + &,FF) ds (8-58) 


~ aw -j 


Where $9, ®;, and ® are functions of s such that o(s) = ®o(—s) and 


(8s) = &:(—s). The optimum system function Fy is determined by the 
condition 


Next let us consider an arbitrary but small variation —_ a 
optimum system function F (s,ko). Let du, dv, and éw repre 
variations in u, v, and v, respectively. Then 

év = Cie + Ce 
bw = Cze + Cie ( 


C;, and C, are independent of « but de 
p to the second 0 


oe] 


1 [ie ‘ 
ony [+ BFP ide = 0 (8-59) 


where F; is an arbitrary system function with all its poles in the LHP. 
As a special case, F; may be equal to Fy. Since 


where the coefficients C1, C 2% 
ent on f(s). The variation in 5u will be evaluated u 


Dy + Bl + bho + SP Fo = by — Soh oF + (4, + BF) Py 


ine: 
aU 1.°U $F - 
au = 50 + Mb + 5 Sage (00) Feat 
19°U hy integrating Eq. (8-60) and making use of Eq. (8-59) we obtain the 


= (bv + ko? 6w) + 5 Gap CO 


C3? 0U 
na 6fi FT dow? é 


Minimum value of f, as 


S(O) = i a (Po — yl*oly) ds (8-61) 


——————— << TU 
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= F, + dF, Eq. (8-58) gives 


For an arbitrary system function F 
foo $0) + gy fer BRO? Ht 5,F) P| ds 
Se eae c 9 . we f ; 
vie 4 | 7* 5,(aF)(3F) ds (8-62) 
20] j 00 


=2 
e first integral vanishes because of Eq. (8-59), 
i i (8-63) 
=5— éF)(6F) ds 
sf. = fe — 140) = 95 |, HOPMEM 


Since th 


i i 7 

Equation (8-63) is an exact equation. It can be eens nae 
ra in terms of a variation in the transfer function 0 t : i a 
a in terms of the pole-zero locations of F(s). Referring to Eq. j 


we have 6F-! = 3H + 6G-}. Since 
Dake ke 
p= Fo= FP a(g. — 7) = PROOF 


Equation (8-63) can be written as 


I a pi — | ga 8-64) 
if. = a |. bP iF P(SH + 80-)(GH + 2G") ds ( 


—je 


20j 


where Fo is the closed-loop system function of the nominal system. 


| Alternatively, Eq. (8-63) may be written as 


Rees 7° 5.Pol?o(6 log F)(5 log F) ds 


1) —je 


1 [3 y 
<< max \5 log F|? Oni ie Joly ds 


OY Re ( 
< max |é log F|? a UE, Bo ds 


s=jw 


i i must 
The second inequality sign is justified by Eq. (8-61) since f-(0) 


‘tive. Inequality (8-65) has a cer al 
sea that positive and negative unit charges are placed at p 


and zeros in a two-dimensional potential analog, the expression 
max |6 log F| 
s=jw 
. . . is | 
is the maximum change in potential on the imaginary &X1s due to disp 
ments of poles and zeros. The integral 


es ¥ & ds ( 
2m) —jo 


tain physical analogy. If it. 


EFFECTS OF INACCURACIES OF SYSTEM COMPONENTS 211 


is the value of the cost function if the control system is turned off. 

Inequality (8-65) is meaningless if the integral (8-66) is infinite. This 
happens when there are poles of &o on the imaginary axis. However, 
Iqs. (8-63) and (8-64) can still be used. 

8-9. Increase of Cost Function with Steplike Inputs. While the equa- 
tions derived in the preceding section are general enough, their physical 
significances are clouded by the generality of the mathematical expres- 
sions. By working out the solution for the special case of follow-up 
systems with steplike inputs, we shall gain some insight into the magni- 
(ude of the increase of the cost function. The inaccuracies of the com- 
pensating elements may be classified as: 


1. Inaccurate gain or time constants 
2. Neglected minor time constants 


Typical examples of (2) are the time constants of what may be called 
the driving stage of the system. For instance, in hydraulic systems, a 
hydraulic booster or a torque motor is commonly used to supply the input 
to the last stage. Its time constants are of the order of one-tenth those 
of the final power stage or less. In steering systems, the hydraulic servo 
for driving the control surfaces can be considered as the driving stage. 
While it is usually justifiable to neglect these time constants in the initial 
design, we should like to know, at most how much increase in the cost 
function is caused by their presence. Since the optimum system designed 
on the basis of considering these time constants cannot be as good as the 
idealized system, the disadvantage of neglecting these time constants is 
not as great as indicated by the present calculations. If the latter values 
ure low, we have reasonable grounds for neglecting these time constants 
in designing the system. 

lor the system of Fig. 4-1, with H,, = 1, and n(t) = d(t) = 0, we have 


3 24) £ 
®,, + b°®,. = $(1 — A — F) + 2O& pre, 
GiGi, 
‘Therefore, &, = YY6,, 


where YY is given in Eq. (4-27). For steplike inputs, ®,,(jw) = A?/w?, 
and i’) = 1/Y. Equation (8-64) becomes 


1 joo Wee - 2 
af. = 5 sh (- *) FF(6H + 6G-)(sH + 8G-!) ds (8-67) 


—jo 


Now we shall consider variations in G, only. Equation (8-67) becomes 


1 fi |AF 
Wa fi 


2 
= Go log G| ds (8-68) 
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Let G, be written as peeile 1 eek Se oe 
g, _ Kt Tas) + Tos) sal TG ee ee et 
= a Pee Tae tw’ pores | | device _| 


| 
i, _ 3K 8 8} 3 Ts ied PURE Ba ES ail 
Slog G = dlogG@. = Fe + > tate: bs i+ Ts (8-69) | 


A=a,b,.--- 4=1,2,.-% 


We note that, in Eq. (8-68), F/G is simply the closed-loop error to the 
input transfer function. If the inaccuracy is due to gain K alone, 
Eq. (8-69) gives 


of, = & am) (8-70) 


The effect of the time constants can be expressed as 


sT\2 1 «fie | AF |? 1 
of. = (7) oni i Laer| ds (8-71) 


“SG 
where T may be one of the Ts or one of the T)’s. The function inside 
the brackets is always less than unity. Therefore 


if. <e (r) 


Now we consider the case of the neglected time constants. 
G’ is defined as 


Ir ee * 
G (1+ Tis) + Cro asim 1a, 8-4. The variables e(¢) and a(¢) of Fig. 8-3 subsequent to a step input at t = 0 


Q = = Sa 
Go qa a Tis) an Ts) a den Mathematicall 
: at atica we - 
where fT oe Bnd Ty Tas the neglected time constants, large enough cane a ee his — a magnitude of nearly unity for 
pe . small values of s, 1 — G’ i 
and for the signifi ? is nearly zero 
sG-1 = ae -a= 70 -@) (8-4 . gnificant range of 1 — G', F/G ean be considered as unity. 
ixample 8-7. Determi i : < 
BaAnation (8-67) becomes = etermine the increase in cost function to a unit step input, assum- 
1.0f LAF nals 
tj | jo (1 + as)(1 + bs) 


Solution 

Equation (8-75) can be interpreted as shown in Fig. 8-3. AF/ 
gives the Laplace transform of the error signal, while 1 — G’ is anot 
cascaded transfer function before the resulting signal is squared. Ii 
8-4 illustrates typical step responses of F/G and 1 — G’, respectiv¥ 
Since the time constants in G’ are very small, its transient term dies 
quickly. The response of 1 — G’ to the error function a of Fig. 8-4 is 
substantially different from its transient response to the step-input f 
tion c. Equation (8-75) can be approximated as 


1—@ = 8@ +b + abs) 
ay (1 + as)(1 + bs) 
fe = je (a +b)? — a%b?s? 


jaan if 
onj J -j0 1 — as) — Be) @ = 2 (« Pts se ;) (8-77) 


8-10. i 
te eats palais of Sampled-data Systems. Most of the results 
a pete Nee es be readily converted to sampled-data systems 
snclature of Chap. 6 will be used in the prese i . 
19 nt t 
|, Transient Response. Corresponding to Eq. (8-3) icetin, 
’ 


de(t,a) a 1 3 (8, 
da 72 { rio) 2 | Gates) Giis,a) i (8-78) 


7 


1 A? , ; 
neg | - 30 — @’')(l — @) ds 


——— 
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If the sensitivity 
8-78) reduces to 
aK o(z,a) 


dc(nT co) se 2 r@| : | \ zr-1dz (8-79) 
not a=0 207 unit circle a roe 


; i Id 
Since Eqs. (8-38) and (8-39) still ho 
ed by Ko(z,«), the results of the 


ae responding ones for the continuous systems, they will not be illustrated 


by examples. 

8-11. Summary. In Chap. 8, we have studied the various effects of 
component inaccuracies and idealization of transfer functions on the per- 
formance of a closed-loop system. For the transient response as well as 
the performance parameters (e.g., maximum modulus) of a closed-loop 
system, the effect of component-parameter change is a first-order effect. 
Sensitivity functions are defined as the partial derivatives of the system 
response or response parameters with respect to the component parameter 


of c(t,a) at sampling instants only is of intere 


2. Pole Sensitivity of Ko(z,@). 
with s replaced by 2, and (C/R)(s) replac 


analysis also hold in the z plane, namely, 


I—1 “ : 
Opi _ eae a ap (8-80) in question. For the mean-square error or the cost function of an opti- 
Oa "L da5 3 mum system, the effect of parameter change in the fixed component is a 


first-order effect, but the effect of parameter change in the compensating 
elements is a second-order effect. 

To give a more detailed account, the effects studied are the following: 

1. Sensitivity of Transient Performance to Component Inaccuracies. 
A transform method which gives the sensitivity functions directly is 
derived for linear systems. The method is valid in general except at 
(= 0. For ¢ = 0, the method is good if the component inaccuracies do 
not change the order of the system. In case the order of the system is 
changed, the transform method gives the values of the sensitivity func- 
tions at ¢ > 0 but not at ¢ = 0. 

For nonlinear and time-varying systems, a set of linear differential 
de equations is derived which makes it possible to compute the sensitivity 
1 0) = a) ®,(2) 6K(z) 6K(z") functions directly on an analog computer. In case the order of the sys- 

fe = fo — Fel 2m) lem is changed by component inaccuracies, the initial values of the 

< max |d log K(2)|? xf &y(2) = (8-84 sensitivity functions at t= 0+ (obtained by the transform method) 
fe S Wet 2nj . should be used. The initial values at ¢ = 0 lead to erroneous results 
hecause of the discontinuity at ¢ = 0. 

2. Sensitivities of Closed-loop Performance Parameters to Inaccuracies. 
Simple equations are derived giving the sensitivity functions of maximum 
tmodulus, peak response, closed-loop poles, damping factor, etc., to 
inaccuracies of the gain and time constants of various control elements. 

3. Increase in Mean-square Error or Cost Function Due to Component 
Inaccuracies. The cost function of a system with power constraint is 
defined as e? + ko?a?, where @ and a are mean-square values of the sys- 
(em error and constrained variable of the linearized system, and ky? is 
(he optimum value of the multiplier 2. In case the constraint arises 
ul of saturation, it is shown that an increase in cost function is approxi- 
iiutely equal to the increase in mean-square error in the actual system. 

Simple equations and inequalities are derived which give the increase 
in cost function in terms of the parameter changes in the compensating 
tleoments. Physical analogs are described for these equations to illus- 
trate the order of magnitude, 


of Ko(z,e) at the pole pi. 
d saatoiss with H(z) = 0, Eq. (8-80) becomes 


(8-81) 


where R; is the residue 
Tn series-compensate 


opi _ R,* log G-(p) 
0a; 0a; 


3. Increase of the Cost Function. Corresponding to Eqs. (8-58), 


(8-63), and (8-65), we have 

f= hig iev@) + DRO + HOKE 
ie - dz (8.92) 
4 HQKCIKONS | 
(8-83) 


he unit circle, these are also poles 
of the system is to have finite val 
—1), these poles are in recipro 


t 

In case there are poles of Bo(2) on t 
@,(z) and zeros of 5K (z) a e? ee ; 
i @)(z) = Po(z!) and b2(z) = P24 
a ai aah an. Let W(z) and #;(z) be defined as 


Wz) =|] Gd -— ae) 


(2) = &(2)W@)W(e") 


Equation (8-83) can be written as 
° de 
2 


1 
fe = aif ®!,(2) K (2) K(2~") | Wo 5 log K(z) 


lz 
; (ey | bf eK @KE) © 
Te >” Ke@| oni? (2) K (2 : 


< max 
z=1 


re simi the 
As the applications of Eqs. (8-78) to (8-85) are similar to 
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MINIMAL-TIME CONTROL OF NONLINEAR SYSTEMS 


j talk of linear systems and 
~ tion. One hears frequently 
A eas s if control systems belong to either one . the ont 
of the two big classifications. Logically, this is sound. : o-— 7 
conveys the impression of equality, and this is not soun : non 
the same way, we may classify space into two subspaces, a 
’ 


nonlinear systems a 


If we select points in space 
Similarly, no system 


and below certain magnitudes. | 

lassified as away from the origin do 
é Thus, if we consider the class of all nonlinear systems, ‘ . 
of higher order compared with the class of all linear ai e oi ; 
chapter, we do not intend to survey all the problems a aoa s 
studies made on nonlinear systems. Rather, we oo — mc 
problem of universal importance, namely, the prob em 0 bee 
control. One well-known special case of minimal-time ¢ 

; 9 
ictor control, or “‘bang-bang™ servo. — 
ite Seales of minimal-time control arises from the wee rr 
vering problems. Generally speaking, there are ete apaere vil 
laze i ical state of the co » 

1. Stabilizing. The desired dynamica = 
fixed or is varying very slowly, and the control force or moments are 
i ‘ly to counteract the load disturbance. Fe 
are edwin: There is a command to change the or —- 
of tive controlled system from one set of values presi n he? al oa 

i in mini i ith as little ove 
i be carried out in minimum time wl ° . 
oe this phase of operation, the effects of load disturbances 7 
sidered small, and the control force is programmed to carry 
mand. ra | 
Sts optimum control for the first application has been pe in a 
4. We shall discuss here the optimum control ot he 
also be shown how, in a single device, the two principles can be re 
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9-2. The Maximum Principle. The guiding principle of minimal- 
time control is the so-called maximum principle. To put it in ordinary 
language, “If you wish to get there fastest, give it the mostest.”? While this 
rudimentary philosophy may not work very well in human affairs, it 
works in a class of well-defined control problems. In fact, a version of it, 
the bang-bang servo, has been known for many years and studied by 
many authors. A very general and precise formulation of the maximum 
principle has been given fairly recently by a well-known Soviet mathe- 
matician, L. 8. Pontryagin, and his students V. G. Boltyanskii and 
R. V. Gamkrelidze. What follows is a heuristic account of this principle. 


The controlled dynamical system can be described by a system of first- 
order differential equations. 


&=F(xft) ¢=1,2,...,” (9-1) 


where x and f stand for the sets of variables 2, v2, ... , 2», and fy, 
Jo, . » » > fm, Yespectively, and the functions F are continuous and dif- 
ferentiable in x and f. The set of variables x is called state variables as 
it specifies the dynamical state of the controlled system. For instance, 
in a second-order system, position and velocity constitute a set of state 
variables. The set of variables f represents the control forces at our dis- 
posal to effectuate the desired changes in the dynamical state of the con- 
trolled system. In general, the magnitudes of fi, k = 1, YR se 1, 
are limited, but within such limits the f;’s can be varied at will. These 
limits are referred to as constraints on f. It is seen from Eq. (9-1) that, 
if at any instant ¢; the values of the 2,’s are given and the f,(t)’s for ¢ > ty 
ure given, the state variables x; as functions of time from the instant ft, 
onward are completely determined. In this sense the state variables 2; 
completely specify the dynamical state of the controlled system. What 
liappened in the past is immaterial. 

Nquation (9-1) is a very general form of representing dynamical sys- 
lems. Systems with simple nonlinearities and ordinary transfer func- 
lions can readily be rewritten in the form of Eq. (9-1) but not vice versa. 

The n-dimensional space with the x,’s as its coordinates is called the 
phase space. Each point in the phase space specifies a set of values for 
the «’s and thereby represents a dynamical state of the system. As ¢ 
inereases, the x,’s vary continuously, and the point « (whose coordinates 
ure the 2’s) traces out a locus in the phase space. The locus is called a 
\rajectory of the system. Obviously, from any given initial point speci- 
lied by w = aj,¢ = 1,2, ... , m, the path of the trajectory depends on 
the f(s. Beeause of the constraints on f, not every curve originating 


from a@ (short notation for the point 2; = a) is a possible trajectory. 


Let Q(t < 7) denote the collection of all points covered by at least one 


fowmible trajectory originating at a at ¢ = 0 and ending at t= T. Since 


—k << ~-— 
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mie LY: ble 
i i finitesimal amount, the possib 
7s can be varied at will by an in =“ 
pes ene are infinitely close to each other, and 2(¢ < T) represe 
‘nuous region in the phase space. ' —_ 
bi on a reasons T originating at a point a and ending ~ : om & 
Tf no other trajectory can reach 6 from a in less not ait is ca a 
j lection of end points at t= 
mum trajectory. The co i 
trajectories originating from at t a8 0 is denoted by ae" s ae 
eaaue the above definitions in mind, we shall procee 


following propositions: 

Proposition 1. Any part of an optimu 
trajectory. Let us select any two points y an 
T from a to B, as shown in Fig. 9-1. Let 


m trajectory 1s an optimum 
d onan optimum trajectory 
the parts of I between 7, 


ie Neighboring trajectory 


a(tST) 


a5 


u3 


ptimal trajectories. 


Fra. 9-1. The region & generated by 0 


i ition sta 
5. and 66 be labeled 1, Ts, and Ys, respectively. The aa 
that T, is an optimum trajectory from ¥ to é. ; vor ee be ro 
If there is a possible trajectory T) along whic c . tallowill ‘ 
time from 7, then 8 can be reached in less time oe jon eel 
trajectory TiTTs instead of following I. This contr 
Pe wD is at the surface of the wegen Qt 3 ee a 
tively this is easy to see. In Fig. 9-1, A@ < 1 2 pele aid 
‘nts on the trajectories of interval T, and Q(T) is ge : yer 
eerie end points which are not covered by other tra) 
Therefore {(7') must be on the surface of act S 4 — 
Now let us consider the optimum trajectory r es a a a 
erse T', the functions fx) are completely determined & 
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as f,(é). Let IT’ denote an adjacent trajectory (also of duration 7’) 
obtained by letting f.(t) vary by an infinitesimal amount 6f;,(¢) in the 
allowed direction. For instance, if f,(¢) is at one of the limiting values, 
df(t) can only be in the opposite direction, but otherwise 6f;,(¢) is com- 
pletely arbitrary. Let X(t) and X(¢) + 6x(¢) denote points on T and I”, 
respectively, at time ¢. By definition, the end point @’ of I’ is a point in 
Q(t < T), and 6x(T) = 8’ — B. 

With reference to Fig. 9-1, let the vector N denote a normal of the sur- 
face bounding Q(¢ < T) pointing outward from 8.¢ Then 

N- 6x(T) = N,62,(T) < 0 (9-2) 
i=1 

Inequality (9-2) holds to the first order of 6x(T). The equality sign 
holds if 6’ is also a surface point. Taking an infinitesimal variation of 
lq. (9-1) gives 


=) ie b(t) +) Ge): iit) 2=1,2,...,0 @8) 


‘The partial derivatives dF;/dx; and 0F;/ df, are evaluated for x = &(t) and 
f = f(t) and are independent of the variations 6x and df. Equation (9-3) 
is correct to the first order and is linear in 6x and 6f. In arriving at a 
solution of Eq. (9-3) there is no need to treat 5x and éf as infinitesimal, 
since the solution of a set of linear equations is independent of the magni- 
tude of the variables. 

Let us consider a time-variant linear system represented by a set of 
differential equations with u,(0) = 0 and 


a(t) = } gam) +o) ¢=1,2,...,n (9-4) 
k 


where u;(¢) and v;(f) are the output and input variables, respectively. 
secause of the superposition theorem, w;(¢) can be expressed as 


wits) = Df, hin(ts,ton(e) at (9-5) 
k 


where hix(t:,t) is the response of output variable wv; at t; due to a unit 


impulse of v,.at ¢ Equation (9-5) is a multivariable version of Eq. (5-1) 
und can be derived in the same way. 


| The normal N may not always exist. For instance, 8 may be located exactly on 
an edge of the region Q(t < 7). However, the likelihood of this happening is negligi- 
hly amall, For linear systems with saturation of the type described in Sec. 9-3, it can 
readily be shown that the region Q(7') (the set or collection of all end points of tra- 
jectorics of duration 7') is convex. Then there is at least one vector N which satisfies 
inequality (9-2) under all circumstances. 


———————— Te 
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Now let us consider the function hiz(ti,t2). By assuming v(t) to be 
impulses at t = te, it is easy to show that 


Ohix(ta,t 
Shatin’) = > Giz (t1) jx (tr,te) 
1 


and hiz(to+,te) = Six 


However, of greater interest is the variation of Aiz(t:,t2) with tf. Equa- 
tion (9-5) holds no matter what form the functions v; take. From this 
fact and Eq. (9-4) this variation can be readily determined. 

1. Let v,(t) be a unit impulse function at f:-, and all the other v(t) = 0, 


ik. Equation (9-4) gives 
u(t) =a us (ty) =U tA k 
Equation (9-5) gives 
s(t) = hie (ts,t1-) 
Therefore hiz(ti,t1-) = a. bari (9-6) 
; eats 0 ifiFk 
Equation (9-6) holds for all 7 and k and is the boundary condition 0 
hix(ta,t). 
2. Let v,(t) be an arbitrary set of impulse functions at ¢ = 0. The 
solution is u,(t) = yi(¢), where y(t) satisfies 
gilt) = Ygu(y() +> 0 (9- 
c , 
Suppose that, instead of the impulses at ¢ = 0, we have input impul 


at t = te- (0 < t < t:) with moments yi(ts). Then us(te) = yi(te), and 
the solution from t: onward remains unchanged. Equation (9-5) gi 


wa(ts) = yilts) = , Pan (taste) ye (ts) @) 
k 


Since Eq. (9-8) is to hold for all values of t2, we may differentiate it wi 
respect to tz. The result is 


ee rs dhalbst) yn(ts) + ee hin (tata) ye(te) 
k 


k 
- ere yn (te) + : hatte 


The second equality sign is due to Eq. (9-7). We may call the du 
indices L and k instead of k and L without changing the double su 
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tion, and the above equation becomes 


» E= + : hulttdona(t) yx(t2) = O (9-9) 
- £ 


Since the initial-impulse functions are arbitrary, the functions y;(tz) are 
also arbitrary. Equation (9-9) implies that ; 


Ohix(t1,t2) 
clay "7 te aa @14(to) hex (t1,te) (9-10) 


L 


a (9-10) together with Eq. (9-6) determines the set of functions 
ik(li,t). 
Equation (9-3) is of the same form as Kq. (9-4), with 


gii(t) = (=), 


“= 2G). 


k 


Therefore, Eq. (9-5) gives the solution as 


6a,(T) = 2 ” " hes) “s (ey afx(t) dt (9-11) 


Inequality (9-2) gives 


N- 6x(T7) = a » » 2 Nihii(T,t) (22), Ofi(t) dt <0 (9-12) 
e sok 


Let d;(t) be defined as 
A; (t) — 2, Nihi(T,t) (9-13) 


liquations (9-10) and (9-6) give 


aa Youre mee py (2) 200 (9-144) 
; 


A(T) = N; (9-14) 


age (9-14a) and (9-14b) specify \,(7’) completely. In Eq. (9-12) 
(the 6f,(t)’s for different k and different £ can be independently selected 
und the necessary and sufficient condition for satisfying Eq. (9-12) is 


or; k= 1,2 
y j FM ee 9M 
» (37) Wi) SO forg oye" (9-15) 


’ 


— OOOO = — — 
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Equation (9-15) has a simple interpretation: Along any given optimum ’ 
Equations (9-14a) and (9-14b) become 


trajectory, with a(t), t= 1, 2,---5 % considered fixed, the function 


H(t,4,f) = Y MOFID (9-16) u(t) = < 
Z TG a ajNi j= 
i j > Qi; hte) ee 7) (9-194) 
is a maximum tf f(t) = Flt), = 1, 2.4 a0. ™ A(T) = Ni ch ile. /n (9-190) 
The above has shown only that H (t,2,f) is a local maximum. A ‘Thin Pewee hawaen funn te 
stronger version can be proved as follows: Suppose that f(¢) = f(t) : 
except in the interval t. — 6 <t <b Equation (9-1) gives - yy ea Sun en 
a aijdi(t)aj(t) + bidi(t 
bas(ts) = {PAB(tr-),E(lo-)ste-] — PARC) Aa) bo) ot oo ee I. ne 
eka mitt) eatdatioe With the constraints (9-18), H cannot be maximum unless 
Pa OF; a 
si > (2%), aso ce fit) = sign of [ yi birdi(t) | (9-20) 
i=1 


| j 


The solution at t = T is 
bai(T) = Y, his(Tyt2) das(te) 
ij 


Wee (9-20) shows that the solution represents a so-called ‘“bang- 

ang system, Maximum control effort in one direction or the other i 

always applied. ee 
r , ‘ 

% be ipintss systems with only one input fi, the transfer function from 
, a 1 can be ee substituting s for d/dt in Eq. (9-17), setting 
2=J,=::-:* =O, an solving f i i 
ton Wea g for the ratio 21(s)/Fi(s). The denomi- 


D(s) = det |Ja;; — 6;;s| 


Therefore 
N.- 6x(T) = Y MP LEE) fb) te — F{&(to-) E(te-) ,t2-1} 6) 
= (A (to-££) — H(te-,&,B)] at 


The ; 

hn a; of D(s) are called the characteristic roots of the matrix 
ii||._ ‘The same roots with reversed sign are obtained in solving a simil 

expression for Eq. (9-19a). i 
The solution of Eq. (9-19a) is 


Since N- éx(T) <0, the above equation shows that H(t2-,&,f) is an 
absolute maximum. As 2 is arbitrarily selected, f = f gives the absolu 
maximum H at every t. 
Thus by assuming an infinitesimal time interval 8(¢) in which f ma 
differ from f by a finite amount, we have proved that f gives the absolu 
maximum H at every t. 
This is the essence of Pontryagin’s maximum principle. 
H(t,4,f) is called a Hamiltonian function. 
9-3, Minimal-time Control of a Linear 
controlled system can be described by a set of linear differ 


The functio A(t) = y bad) = ¥ Ajenait 
t=1 i=1 


q i 

son — -_ real, X(t) crosses the zero line or ¢ axis at most n — 1 
? rai Pes ows oa fi reverses sign at most n — 1 times. If some 
ried ‘ re complex, d(t) may cross the ¢ axis considerably more than n 

‘s in an interval 7 large compared with the period of oscillati 
wime holds for the number of sign reversals of f1 wee: 
% . aeons the optimum solution of the control force f; is always at 
aetteies 3 ee value until the desired steady-state condition is 
pons ; : whict fi: = 0. If all the a;’s are real, or if some of the a;,’s 
omplex ut 7 is less than half the natural period, there are, at eee 
(« n~ 1 sign reversals of fi. If some of the a;’s are complex and Tj i 
(he number of sign reversals of f; may be more than n — 1 pial 


System with Saturation. i, 
ential equatio 


j=n k=m 
= SF amt SY dae §-12---.9 
j=l k=1 


In Eq. (9-17) fx are the control forces or the independent variah 
through which the control function is performed. By properly soull 
the constants bi, the constraints on f, can be written as 


fil St ko1,2,.66 5™ 


—— EO —— “(~~ 
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Example 9-1. An inertial system of negligible damping is to be moved from one 
angular position @ to another in the least time. The inertia of the system is J, and 
the maximum available torque is K. Determine the optimum form of fi(t). 

Solution. While the problem is a simple one, it can be used to illustrate the method. 
Let a: and x2 represent @ and 6, respectively. The system equations of motion can be 
written as 

at, = Xe 


: K 
aga : 
The applied torque is Kfi(t), and the limits of f: are +1. Equation (9-19a) gives 
(9-21) 
From Eq. (9-21), 2 can be solved: 

re(t) ba! Ai + Ad 


AW) = buds) = (A + Aa! (9-22) 


Only one sign reversal of f: is possible. To satisfy Eq. (9-11), the sign reversal must 
occur at { = 7'/2. The change in 0is KT?/4J. 
Example 9-2. Asa second example, consider the system described by 


6 +266 +0 = Ifi® 

\fi@| <A 
For any initial value of 6 and 6, the system is to be brought to rest in a minimum time, 
Solution. Let a and x2 represent @ and 6, respectively. The equations of motion 


are 
“, = 2 
by = —a1 — 2$a2 + bf) 
Equation (9-192) gives 
Ns = re 
ho = —Mr + 2or2 


Solving the above equations,we obtain 
A(t) = drat) = Aes sin (J/1 — ¢t + 9) (- 
In Eq. (9-23), A and ¢ are constants of integration. We note that A has no effeet 


the positions of zero crossing, for the time interval between two zero crossings is alW 
the half period r/V/1 — 5?- Consequently, for large T, f: must reverse every 
period. Physically, large 7 means a relatively weak control force bfi. To br 
an oscillatory system to rest with a small force, the applied force must be pro 
phased. 


9-4. Predictor Control of a Second-order System. If the control 
system is linear, it is quite feasible to use the system error and © 
derivatives as the state variables instead of the controlled variable and 
derivatives. An advantage of this is that the desired final state is al 
at the origin of phase space, with the resulting simplification in electron 
Section 9-3 shows that this point should be reached by applying maxi 
control effort all the time, either in one direction or the other, 


derivative versus error of the controlled system, 
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way for realizing these ideas is the i 
predictor type of cont i 

by phase-plane or phase-space criteria. 2 Oe oe 

In ie McDonald published his paper on the optimum control of a 
second-order system by phase-plane criteria. Essentially the same idea 
was a 4 paper submitted by Hopkins to the AIEE at about the 
same time. ork on systems of third order and hi 

igher was re i 

— years by Bogner and Kazda, Silvia, Rose, and others vec 
a onsider the second-order system discussed in Example 9-1, with the 

e assumption that the reference input 6; is a linear function of unas (step 
ramp). Once the error and error derivatives are simultaneously zero oan 

? 


=é 


Fia. 9-2. Trajectories in phase plane e-é. 


have the desired steady- iti i 
othe! eady-state condition, and the applied torque can be 


In terms of the error e = 0; — 6, the equation of motion is 


de K 
| ental 5.43 (9-24) 
eee a ee 
a A fi +A,=—- + full — to) + (5)... (9-25) 
e=- Xft + t)' + (5), (t — to) + e(to) (9-26) 


The phase- i j i i 
| e-plane diagram of Vig. 9-2 is obtained by plotting error 
For each set of initial 
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conditions, the error and error derivative vary with time, as given by 
Eqs. (9-25) and (9-26), and trace a curve on the diagram. Each curve is 
called a trajectory. There are two sets of trajectories in Fig. 9-2, cor- 
responding to the two values of f1: The solid curves are for fi = +1, and 
the broken curves are for fi = — oe 

The life history of a system in operation is traced by a sequence of 
segments belonging to the two sets. The sequence of curves AB, BC, 
CO represents a system having an initial error é:,and f: = +1, —1, and 


+1. As soon as the representative point of the system reaches 0, both 
e and é vanish, and fi is made to 
ba vanish also. 


From the discussion of Sec. 9-3, 
we see that ABCO does not represent: 
an optimum system, since there are 
two changes of sign of fiat B and C, 
respectively. AB’O is the only tra- 
jectory that reaches the origin with 
one change of sign in f,, and cons@= 
quently it represents the optimum 
system. 

There are two trajectories which 
lead to the origin: PB’O and QCO, 
The two trajectories join at the origin 
and can be regarded as a single 
curve. One way of making an opt: 
mum system is to make the cury 
PB'0CQ aswitching boundary int 
sense that f: = +1 ifthe represen 
ative point is above the switching boundary and fi = —1 otherw 
There is no need to define the value of fi on the switching boundary 
Consider a system with an initial error ¢1. Since the representally 
point is above the switching boundary, f: = +1, and the point trave 
along AB’ until it crosses the switching boundary at B’. Theoretical 
an infinitesimal amount of crossing is sufficient to reverse the sign of f 
and the representative point travels along B’O to the origin 0, Mi 
system error as a function of time is plotted in Fig. 9-3. Similarly, wi 
a ramp input, the representative point starts at P’ and traces Pan 
The system error versus time is plotted in Vig. 9-4. 

9-5. Physical Realization of an Approximately Optimum Second-or 
System. The system described in Sec. 9-4 cannot be realized oxime 
because of its requirement of infinite gain. Theoretically, the swité 
boundary is infinitely thin, It takes an infinitesimal change in ¢ or 
the boundary to change fi from maximum positive value to mx 


0.5 


0 t 


Fic. 9-3. System error subsequent to 
a step input. Solid curve, idealized 
predictor-controlled system; dashed 
curve, actual predictor-controlled sys- 
tem with finite gain. The points A, 
B’, and O correspond to the same points 
in Fig. 9-2. 
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ieseaad : ; hen es 
gative value, or vice versa. While an infinite gain is not possible, it is 
? 


also not essential in the o i 
peration of the system. i i i 
well-designed system is not appreci- : peel 


ably slower than the ideal system. . 
Consider the lead-compensated 3 

system of Fig. 9-5. The torque fi 

is zero if e + T(de/dt) = 0. The 

switching boundary of the system is 

shown in Fig. 9-6. There isa linear 

region about the neutral boundary 

e+ T(de/dt) = 0in which f; is pro- 

portional to e + T'(de/dt). Either O ¢ 
beyond AA’ or below BB’ the sys- : 
tei dk sabumiadeeatl fy Samale 1 Fig. 9-4, System error subsequent to a 
and a, respectively. As the sys- eee as 

- gain is increased, both AA’ and BB’ move toward the neutral bound 
ary and merge with the latter in the limit, as the amplifi i oa 
nee , plifier gain approaches 
P Nic ot croteh ied atc pede see bal et parallelism between 
control can be obtained by bendin Gas iin diye 
controlled system. The Sid iee term ee ie a ee 


Torque 
saturation 


sere . 
Fie. 9-5. Block diagram of a second-order system with lead compensation 


eR: the straight error term is made more effective at low error 
et a Stioens tye by introducing nonlinear elements into the 
rere a in Fig. 9-7. While two nonlinear elements N1 and 

» shown, one is usually sufficient. For instance, we may have a 


straight linear element for N1 1 
nt in leme and a nonlinear el i 
following input-output relationship: see SP 379 ae 


Vout ed = V vial 


i Alternatively, we may have a straight linear 


(9-27) 


with the same sign as v 


—— Lx —————  é&8 oO 
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element for N2 and a nonlinear element for N1 with 


Vout = Vin Vial (9-28) 


Of course it is also possible to use two nonlinear elements to give the 
same over-all effect. 

With finite amplifier gain, the ideal switching boundary should not be 
used as the neutral curve. Rather, the boundaries themselves should be 
part of the saturation limits, as shown in Fig. 9-8. Part of the ideal 
switching boundary AB becomes the upper saturation limit, while another 


Fic. 9-6. Equivalent switching boundary of a second-order system with lead com- 


pensation. 


Torque 
saturation 


ia. 9-8. Phase-plane diagram of an actual predictor-controlled system with finite 


Matched diodes gain. Diagram at upper right corner shows magnified view near origin. 


or thyrite 


part CD becomes the lower saturation limit. In the region BECF the 
system is linear and not saturated. Obviously, Eqs. (9-27) and (9-28) 
are modified to realize the switching zone of Fig. 9-8. 

Jonsider a system with initial error e;. At first, full f, = +1 is 
applied. As soon as the system trajectory enters the switching zone at 
HI, f; is reduced in magnitude, and there is less change in é. Conse- 
quently the trajectory moves toward the horizontal direction and bends 
over gradually until finally it coincides with DC. By now fi = —1 is 
applied, and the representative point follows trajectory DC until it passes 
c. At this point the trajectory for f; = —1 separates from the lower 


To amplifier 


a (b) = 


of a predictor-con trolled system, (a) Block ding 
linear control (the clements enclosed by broken linow) 


Fia. 9-7. Physical layout 
(b) possible circuitry for non 
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saturation curve, and the representative point of the system enters into 
the linear region. It follows a path somewhere between the lower satura- 
tion curve and the trajectory for fi = —1, as illustrated by the broken 
line. The system-error-versus-time curve is shown as the broken curve 
in Fig. 9-3. 

9-6. Phase-plane Configurations of Second-order Systems with Vari- 
ous Pole Locations.+ The trajectories and optimum switching boundaries 
depend very much on the locations of the poles of the controlled system. 
We shall study the following cases: 

1. Real roots, stable system. Typical example of root locations: 
0, =—L 

2. Real roots, unstable system. Typical example of root locations: 
0, 1. 

3. Complex roots, stable system, ¢ > 0. Typical example of root 
locations: —¢ + jv — ¢. 

4, Complex roots, unstable system, ¢ < 0. Typical example of root 
locations: —¢ +7 V/1 — £2. 

In the above, we are referring to the controlled system only, not to 
the entire servo loop. Because of the existence of nonlinearities in the 
closed loop, the latter cannot be characterized by root locations. 

It is worthwhile to make a detailed study of second-order systems. 
These systems are simple, and many of the phenomena we observe here 
can be carried over to higher-order systems. : 

The differential equation for cases 1 and 2 may be written as 

dc 

dt? 
where 7 = 1 for case 1 and —1 for case 2, and f = +1 for an optimum 
system. In terms of the error ¢ = ¢: — ¢, where c; is a constant, Hq, 
(9-29) can be written as 


dts 
ttn" = J (9-29) 


de de 


gp tq I 
Let x = e, and y = de/dt; Eq. (9-30) becomes 


(9-30) 


dy ste 
dt +n =—-f 
Replacing dt by (dx)/y in the above and rearranging terms, 


y dy 
f+ 
+ After N. J. Rose, Optimum Switching Criteria for Discontinuous Autom 


Controls, [RE Conv. Record, pt. IV, vol. 4, 1956, and H. 8. Tsien, “Dnginee 
Cybernetics,” pp. 154-157, McGraw-Hill Book Company, Inc., New York, 1954, 


= —dz (M31) 
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Integrating Eq. (9-31), we obtain the equations for the trajectories: 


iy? | f 
me t+y= log (u a ‘) + constant (9-32) 


For case 1, n = 1. We obtain the two sets of trajectories as shown in 
Fig. 9-9. The optimum switching boundary is given by the two trajec- 


— oe and BO. ‘The case is not substantially different from that of 
ec. 9-4. 


Fig. 9-9. Phase-plane diagram with roots of controlled system at (0,—1). 
bi case 2,7 = —1l. We obtain the two sets of trajectories as shown 
a Vig. 9-10. The optimum switching boundary is given by the two 

anni AO and BO. However, if |y| > 1, all the trajectories are 
ivergent, and there is no way of bringing the system back into the |y| < 1 
none. Physically, the divergence is expected, since with |dc/dt| > 1, the 
negative damping force is larger in magnitude than the control force f. 
lhe differential equation for cases 3 and 4 can be written as 
d’c de 
wteate=s (9-33) 
Jae * H 
W f shall study the optimum system for reducing c and ¢ to zero from some 
initial values. Solving Eq. (9-33), we obtain 


%™¢ =ft Ae7$ sin a ap + ri) 
y =¢ = Ae cos (V1 — + + o1) 


where A and ¢ are constants of integration and #, = sin=' ¢. 


(9-34) 
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Fig. 9-10. Phase-plane diagram with roots of controlled s 


Fic. 9-11. Phase-plane diagram with a pe 


ystem at (0,1). 


1ir of complex roots. 
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For case 3, the trajectories are spiral-like curves converging on ( f,0). 
The set of trajectories with f = 1 are shown as solid curves, and the set of 
trajectories with f = —1 are shown as broken curves in Fig. 9-11. The 
trajectories A A’O and BB’O are two parts of the optimum switching 
boundary. However, the optimum switching boundary does not extend 
beyond A and B since it takes more than one-half cycle for points beyond 
A and B to reach O, and it has been shown in Example 9-2 that f cannot 
last beyond one-half cycle without changing sign. Consider the point P, 
for example; it is faster to start with f = —1 and to reverse f at A’ than 
to have f = 1 all the way. It takes less time to traverse the trajectories 
PP’A’O than the trajectory PAA’O. 

To determine the rest of the optimum switching boundary, consider, 
for instance, the trajectory CC’B’ which intersects BB’O at B’. Let C” 


ia. 9-12. Optimum switching boundary and typical trajectory of a controlled system 
with a pair of complex roots. 


be the point on the trajectory CC’B’ from which one-half cycle is required 
to reach B’. C’’ is a point on the optimum switching boundary. If we 
move the point B’ from O to B, the point C’’ traces the are AC’ D. It 
can be considered an extension of the optimum switching boundary 
OA'A, If the above process is repeated, the optimum switching bound- 
ary of Fig. 9-12 is obtained. Starting with any point P, the trajectory of 
an optimally controlled system is also shown in Fig. 9-12. 

For case 4, most of the above discussion still holds. However, the 
trajectories of the system are outgoing spirals moving away from the 
points (1,0) and (—1,0). The optimum switching boundary can be 
obtained in the same way as in case 3, resulting in two series of nearly half 
circles of diminishing size, as shown in Fig. 9-13. The circles converge at 
two points (+a,0), where 


—— EO i". 
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2. The desired final state is a free-running state of the system: 


from (a,0) with f = 1 passes the point (—a,0), 
from (—a,0) with f = —1 passes the point 
tories mentioned above enclose a region in 
Outside this region, the 
vergent tendencies of the 


The trajectory starting 
and the trajectory starting 


(a,0). The two critical trajec 
tem is convergent. 


k to overcome the di 


which the controlled sys 
control force f is too wea 


controlled system. ve j=n 
w= by Gijhrj = 1, 2, Aa > 0 
j=1 “ 


Let a set of error variables y; be defined as 
Wer—t% t=1,2,;..,n (9-35a) 


From Eq. (9-17) and assumptions 1 and 2, we have 


; j=n 
Yi GYy — OF 
Ds aiid — Disha (9-36) 


i=n 
@€@=Ci -c= hy: 
2, hw (9-37) 
In the following analysi ; 
: ysis, we shall divide the systems i 
a “a ne characteristic roots of the Seana? lac ; bi 
: e characteristic roots a; are disti oe 
th ; : i istinct. Let y and 
Me See bine yry2 °° * Yn and biber + * : ¥ hi oe 
squar : eave matrix with hihz + + * hn as its RYN oe a vd ii 
square matrix with a; as its elements. Equations (9-36 , an the 
be written as ) and (9-37) can 


: = eke ‘ (9-36a) 
sf ea eS (9-37a) 


Fia. 9-13. Optimum switching boundary of a system with a pair of divergent complex 


roots. 


Since the characteristi 
ce roots a; of th i isti i 
eee ee the matrix A are distinct, there is a 


9-7. Predictor Control of Higher-order Systems. The general co 
a = TAT 
(9-38) 


siderations for predictor control of a system 


about the same as the ones men 
nth-order system to be in the desired steady state, 


only zero error but also zero error derivatives up to the (n — 1)st orde Bax 
Therefore, the simultaneous zeroing of n variables is required. 8 * Ae (9-39) 
phase-plane technique is carried over into a phase space of n dimension wrtiiieha a b (9-40) 
as will be described presently. ultiplying the left-hand side of Eq. (9-36a) b 
FEDS T-1 : 
The controlled system is des ) by T~!, we obtain 
controlled variable c is a linear com 


is diagonal. Let the i 
we need to have ti defined by set of variables u and the column matrix § be 


cribed by Eq. (9-17). In general, t 


bination of the state variables; a = au — Gf: (9-41) 


' a i ) T1X Kx 9-4 Cc 
WNCE 5a diagor al mat 
1 . f 
, | ( 1) an be written in terms of its 


i=n 


c= y, hits 


u = ~ 1 : 
OU Bil oh IY 2, Miers (9-42) 


— LEO “~-— 
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The solution of Eq. (9-42) can be written as 


Ui = e Sa (ws ca ® p) exst—te) 


(9-43) 


io 1 initial value of ui at ¢ = bo. 
Ne Tc savie u; are called normalized coordinates, since be 
variables u; in Eq. (9-42) are separate whereas the variables yi in ~ 
(9-36) are not. Because the transformation matrix T 1s none 
simultaneous zeroing of the variables wu; 1s equivalent i simul ey : 
zeroing of the variables yi. Therefore the set of variab s Ui, 0 e ; 4 : 
_.. 4m, can be used as coordinates of a phase space, and ¥ e sien" a 
phase space represents the desired steady-state ee a. quation oz 
represents the parametric equation of a trajectory. iven bee he 
initial values wio, 7 = 1,2,--- 5% 4 trajectory is traced ba a —7 
The optimum switching boundaries in phase space u; can 7 a 
by a reversed time technique as follows: The final or (n - )s a 7 
boundary is composed of points on either one of two vpieatarone - ui ei 
pass the origin in some future time. These points can : te) ke } be 
tracing back from the origin. Let us say that the trajec eee a 
through wio at to. At some prior time t, t < to, the aie a va 
is at P, with its coordinates u; given by Eq. (9-48). Le ee ‘< a 
That is, 7 is the interval of time which te tr ae in 
system takes to reach the point wio. Equation (9-43) can be 
u(r) = Ph + | Uio — & i.) (tae (9-44) 
For the final switching boundary, the destination is the origin, and wo = 0, 
Let these points be denoted as 


u(r +) = 2 — **) 


Bi es 
P a! ee } —e€é O5T 
ury—) = — 5 ( ) 
The next to final or (n — 2)nd switching boundary is a surface one 
by trajectories which will intersect the final switching bow “fi tal 
future time. Obviously, in order that these trajectories do * ee 
with the final switching boundary, fy: must reverse sign at the poin 


j i ing 0 
intersection. The surface generated from trajectories converging 


uit, +) is 
u(t1,723+,—-) = — es (1 — e~™") + uj(t1,-+)e~™"* 


= Bi |-1 + 2% a enn | 
ay 
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Similarly, the surface generated from trajectories converging on u;(r, —) is 


tu(ryres— y+) = FEEL = 2e-an + enor] (9-48) 
Equations (9-47) and (9-48) together give the parametric expression for 
the (n — 2)nd switching boundary. 

The process can be continued until we have obtained all the switching 
boundaries including the first, which is an (n — 1)-dimensional continuum 
in an n-dimensional phase space. The optimum switching boundaries 
so obtained are always good for a region near the origin. If the charac- 
teristic roots are real and negative, the optimum switching boundaries 
are good for the entire phase space. If the roots are complex with non- 
positive real components, at far-away regions the above switching bound- 
aries may not be optimum but the system trajectories still converge. If 
the roots have positive real components, the system trajectories at far- 
away regions diverge. 

2. The characteristic roots a; are not all distinct. The problem can be 
treated as a limiting case of a system with all distinct a,’s, but some of the 
a,’s are approaching each other in the limit. However, direct solution of 
the differential equation involved is usually simpler. This is especially 
true when the multiple roots occur at s = 0. 

If the transfer function of the controlled system is expressible as a 
constant over an nth-order polynomial D(s), the error and error deriva- 
tives up to (n — 1)st order can be used as coordinates of the phase space. 
The differential equation of the system is 


n 


die 
D(p)e = » a; 53 = bf 


j=0 


(9-49) 


If e and all the time derivatives of e up to the (n — 1)st order are simul- 
taneously zero and if f is also made to vanish, the nth derivative is also 
vero, and there cannot be any changes of the values of e and derivatives 
of e. Therefore the origin of phase space represents the desired steady- 
state condition. In the solution of Eq. (9-49), the error and error deriva- 
lives are expressed as functions of ¢. The switching boundaries can be 
obtained in the same way as in the previous case by the reversed-time 
tochnique. 

If the transfer function of the controlled system is N(s)/D(s), where 
hoth N(s) and D(s) are polynomials of s, not all the error derivatives are 
continuous when the sign of f is suddenly reversed. Consequently these 
derivatives are not suitable coordinates of the phase space, as the repre- 
sentative point in such a space jumps at switching. A method to remedy 
(his situation will be discussed next, 
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9-8. Switching Discontinuities in Phase Space. Let the transfer 
function of the controlled system be 


) b;s? 


Ge) = = (9-50) 


For a unit step change in fi, there is a corresponding step change in the 
(n — m)th derivatives of c. Its magnitude can be evaluated by the limit 


theorem: 
(=) =i [aro S| = bm 
dt-™ t=0+ eae $s An 


Since the desired output c is either a constant or a power of t, the same 
step change will appear in the (n — m)therror derivative. It also implies 
that the higher derivatives are not even finite at the moment of switching, 
As the operation of predictor control is based on the assumption that the 
representative point remains at the same position before and after switche 
ing, the time derivatives of c of (n — m)th order or higher are not suitable 
as coordinates of the phase space. We shall find a set of variables that 
are suitable. 

Equation (9-50) can be expanded as a power series of 1/s: 


G(s) = » ks z (9-51 


where the coefficients k; are zero fori <n —m. A set of variables 
are defined by 


Us = ¢1 — e+ kofi = e + kof 
tgs = St bef 1 =e Os Seer 


The variables u; are shown to be continuous as follows: From Eq. (9 
ty = Cy SH a kof (9 
Taking the Laplace transform of the above equation, 


S{u — ei} = (- Ee ks) Py(s) = (- y4) Fis) @ 
i tal 


i= 


Each sign reversal of f; is equivalent to a step change with /'\(8) = 4 
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The immediate change in uw; — ¢; can be calculated by the limit theorem: 


6(ui — = li = a ae 
(ui — 1) are o( by ‘) ce 0 (9-56) 
The variable wz is defined by Eq. (9-53): 
du 
U2 = > a + kif 
dey d(u C1) 
we at hh 


£ {us i aa} i (-: y Ss 7m ks) F,(s) 


Repeating the above steps, we obtain 


: ay ac, % ky 
£ fe fl = (- » ) Fs fee. Red eee 


rom ps (9-57) and the limit theorem, it can be readily shown that 
Ui — (d'ci/dt’), 1 = 1, 2,...,n—1, is continuous at the moment 
of switching. Since c; is either a constant or a power of ¢, the tim 
derivatives of ¢; are continuous. Therefore, all the veriahl : 
(=1,2,... ,n, are continuous. ve 

In the phase space with coordinates Ux 
ut the same position while fi 
When the representative point 
error and all its derivatives vani 
space criteria are met. 

The discontinuity problem arises out. of the 
sentation of the system. If the system dynamics is put in the form of 
Niqs. (9-36) and (9-37), the set of variables y: is always continuous at 
switching, since the step change in f; is absorbed by the 9; term. The 
not of variables yi or any set of variables which is cheained by a non- 
vingular linear transform of yi can be used as phase-space coordinates 


the representative point remains 
reverses sign or changes from +1 to 0. 
reaches the origin and f is removed, the 
sh. Thus all the requirements for phase- 


transfer-function repre- 


Fee! 9-3. Consider the aircraft-pitch control problem described in Sec. 4-7 
ransfer function from elevator angle 6 to the lincar acceleration a in pitch di 
fion can be written as pce 
A (s) re k's? + k''s + w,*k 
A(s) 8? $F fans Fb wy? 8-0 


———— 
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where the constants k, k’, k’', and ¢ are expressible in terms of the constants in Eq. 
(4-102). The elevator is rate limited: 


|| < Om (9-59) 


Determine a set of suitable variables for phase-space coordinates and the optimum 


switching boundaries in the space. 
Solution. Let f denote ds/dt. The transfer function from f to a is 


A(s) k's? + kl's + wn2k 


F(s)  8(8? + 2f0n8 + on”) om 


Obviously a is continuous at switching, and so is the error ¢ = 1 — 4, where a; is the 
ordered value of a. Since this is a third-order system, the desired steady-state condi- 
tion is represented by simultaneous satisfaction of the following equations: 


e=0 5 = const (9-61a,b) 
de _ ae 
ai =0 de =0 (9-61¢,d) 


Since a is a constant, Eqs. (9-61c,d) are equivalent to 


da da 


7 2 
From Eqs. (9-52) and (9-53), 
uwm=e 
us -“ ~Kp= Fo — k’'d) 


A third variable ws can be obtained by a straight application of Eq. (9-53). 
it is interesting to use a different method. Equation (9-58) can be written as a diff 


ential equation: 


At the desired steady-state condition, da/dt vanishes and disaconstant. If it is 
given that a — ké = 0, Eq. (9-64) ensures da/dt? = 0. Since both a and 6 are 
tinuous at switching, we may use @ — ké as the third variable: 


(« 


uz; =a — kb 


A more direct way of arriving at a set of phase-space variables is by writing 
dynamical equations in the form of Eq. (9-17), with a, g, and 6 taking the place of 


£2, and 23: 

2 ee aon 
7 -uvetttmuy’ 
Mf  -Ha- Fats 
dé 
thee 

= ch crs 
+ ae 


MINIMAL-TIME CONTROL OF NONLINEAR SYSTEMS 241 


Equations (9-66) to (9-68) are th 
Wa aia ) are the same as (9-17), and Eq. (9-69) is the same as (9-35). 


TR 1 Cr 
det |a;;| tn i 
et |ai;| = cu Geen =: 
a - vi a (9-70) 
0 0 0 


Setti ight- i 

ate — ~~. ee side of Eqs. (9-66) and (9-67) equal to zero, two equatio 

i i OF : are obtained. From these equations and Eq. (9-69), the st mn 
ues of ai, gi, and 6), corresponding to a given ai, can be daleolated: Paes 


n=F 
a=F 
_M ; M k’ 


The variables y1, y2, and ys are defined by Eq. (9-35a): 


CL 
am ey. a 
St 2" 9 ->¢ 
ead eae ee ee 
3 a1 I 5 


Any nonsi i 
tt ‘f pease eon bso ge of the y’s are adequate for phase-space coordinates 
Baa (931 ant a at Ui, Ue, and Us are linear combinations of yi, yz, and y ; 
Bq. (9-60), us is see show that wu; is a linear combination of y1, y2 snd voi a 
J, is 2 lin nite panera . ieee ga toa = —%. Since bu = 0, Eq. (9-36) shows that 
the y’s, as it ca ogee of y1, yz, and ys. The variable ws is a linear combinati 

ys, n be written as ination of 


Us =a, —kb —e =y3 — uw 
We see that the two methods are equivalent. 


I 0 deter mine the swit 1 2. p. y . 
g 
( ) chin; boundaries, U1, Ua, and U3 are sol V ed ex licitl in t 


dus du 
de + 2Fwn o + onus = (—2fonk + kf (9-71) 
fe ns a” 
Us = 25 (—BWouk +k") + Ac tent sin (Y + ont V1 = §) (9-72) 


where A and ¥ are constants of integration. 


d 
Us = lus + (& — D8) 


=~ by + onAartvet 008 (YW +b + wat V1 — §) (9-73) 
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where ¢ = sin! ¢. 
ua =a, —a@ = 1 — Us — kb 
= B — kft — Ae text sin (Y + ont V1 — @) (9-74) 


The constants A, B, y are determined by initial conditions. Equations (9-72) to 
(9-74) give the system trajectory in parametric form. From these equations, the 
switching boundaries can be determined by the reversed-time technique. 


9-9. Phase-space Technique Versus Programming. There are two 
ways of realizing optimum controls of high-order systems. 

1. Phase-space Technique. As described in previous sections, the sys- 
tem operates on error signal and its derivatives or equivalent variables. 
The system is a feedback system. In practical cases with finite gain, 
there usually exists a linear region near the origin. 

2. Programming. For controlled systems with real roots or complex 
roots of low enough natural frequency, the number of switchings (or 
sign reversals of f:) for an optimum system is n — 1. Let the time at 
the various switchings be denoted by ti, 7 = 1,2, ...,%— 1, with the 
origin of t coinciding with the entry of an ordered change c:. Each 4; if 
a function of cy. The controller hasn — 1 nonlinear elements represent» 
ing the n — 1 functions #,(¢1) and causes f to reverse sign at these pre= 
determined instants. This type of control is a feedforward system, 
Usually it is used in conjunction with a linear feedback system such that 
at the end of the operations the controlling function is reverted to the 
linear system. 

Figures 9-14a and b are functional diagrams illustrating the operations 
of the two types of systems. 

Functionally, the phase-space type of control is considerably super 
to the programming type. For the latter to operate well, the syste 
must be in a steady-state condition or “in trim” before entering 
ordered change c:. This implies also that, once a change ¢, is orde 
it must be carried out before another change is ordered. There is tit 
such limitation with the phase-space type of control. The system alwi 
carries out the best, or fastest, method of reaching the desired steady-#lt 
condition, irrespective of the initial conditions. Furthermore, in 
there is load disturbance, the phase-space type of control is also superior) 
Small and constantly present disturbances are counteracted by the lin 
mode of operation in both systems. Large isolated disturbing foros 
moments cause the representative point of the system to be dislod 
from the neighborhood of the origin, and the phase-space type of cont 
prescribes the fastest maneuver to restore it. However, as the ¢ 
caused by disturbance is usually not a simple step change, the prog 
ming type of control is totally ineffective. 

So far as instrumentation is concerned, the programming type in 
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pler for systems of third order or higher. It is much easier to assemble 
n “— nonlinear functions of a single variable than to assemble n — 1 
nonlinear surfaces and curves in phase space which are functions of several 
variables. The noise picked up by measuring instruments affects only 
the linear mode of a programming type of controller, whereas it affects 
both the linear and nonlinear operations of the phase-space type of con- 
troller. Consequently, noise filtering in the latter is much more of a 
problem, since the effects of the filters in the phase space are difficult to 
determine exactly. 


Phase-space Control 
variables, w ; Signal ae 


Summing 
| and Phase-space Controlled 
differentiating simulation system 
circuits dynamics 


(a) Controlled 


system 
Timing 
circuits 


dynamics 
Linear 


controller 


Storage 
circuits 


() 


"ta. 9-14. Two ways of optimum maneuverin 
control. - ‘ 
(6) programmed controller. : rane nore 


9-10. Preliminary Filtering for Predictor Control. There are two 
types of noise existing in a measuring instrument: sinusoidal interferences 
picked up from stray fields of power components and random noises from 
loss artificial sources. The sinusoidal interferences can be removed by 
carefully shielding the sensing elements or by using sharp antiresonant 
networks in the earlier amplification stages. The random noise is not so 
readily removable. It has a much wider bandwidth than the signal band- 
width and usually overlaps the latter. An optimum linear filter carefully 
designed by the techniques of Chap. 4 is called for, and the same filter is 
used for both the linear and nonlinear range of operations. 

lor proper nonlinear operation, time delay in the filters must be com- 
ponsated for by advancing the switching boundaries. Tigure 9-15 illus- 
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trates how this can be done for the second-order system described in See. 
9-5. Instead of using the two trajectories as the farther boundaries of 
the switching zone, as shown in Fig. 9-8, the switching zone is moved 
farther in. As the error signals are delayed in the noise filter by an 
interval approximately equal to the phase slope (d8/dw) of the filter at 
w = 0, the trajectory of the system projects slightly beyond the farther 
boundary of the switching zone, as shown in Fig. 9-15. 


. Saturation 
Noise 
: Phase- Controlled 
ore’ plane system 
o controller dynamics 


trajectory Actual switching zone 


Fig. 9-15. Phase-plane controller with linear filtering. (a) Block diagram; (b) mo 


fication of switching zone. 


9-11. Modified Phase-plane Control of a Third-order System. 
discussed in Sec. 9-9, the phase-space type of controller is more desiraly 
in operation, because it is a feedback system and is inherently m 
flexible. However, its cost is high because of the necessity of assim 
ing surfaces in space which are functions of several variables. One 
of realizing many of the desired features of phase-space control wit 


+ U.S. Patent 2,973,926, Control Apparatus, issued to Sheldon 8, L. Chang, M 
1961. 
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paying the cost is to use a combination system. The arrangement of the 
system is shown in Fig. 9-16. There are two essential control units. 

1. Stabilizing Unit. When the contactor is open, the unit admits only 
derivative signals ¢, é, or the equivalent, and switches f in such a way 
that é and é are reduced to zero in the least time by the phase-plane 
method. Once this is accomplished, the contactor closes. This unit is 
similar to the second-order system described in Sec. 9-5. 

2. Dislodging Unit. The dislodging unit consists of the nonlinear 
element 1 and the contactor. It is not effective when the contactor 
is open. When the contactor is closed, the controlled system is in a 
steady-state condition with a certain value of c. If ¢ is not equal to C1, 
a voltage which is usually a nonlinear function (nonlinear element 1) of 
the error c; — ¢ is introduced into the stabilizing unit and causes f,; to be 
at its maximum value in the direction of reducing c; — c for an interval 
depending on ¢; — c. At the end of this interval, the contactor opens, 


Contactor 
ey 


Stabilizing 
unit 
(phase-plane 
control) 


Fra. 9-16. Block diagram of modified phase-plane control of a third-order system. 


and the stabilizing unit takes over the control function. The nonlinear 
clement 1 is designed to hold the first step just long enough so that, after 
stabilization, c, — ¢ is zero or approximately zero. 

With the contactor closed, for small inputs and small load disturbances, 
the system acts like a stable linear system. 

An example of such a design is an experimental aircraft-pitch linear 
ucceleration controller for the problem described in Example 9-3. The 
input signals to the stabilizing unit are uw, and us. Two dimensionless 
variables are defined as 

WnU3 - U2 
Gk), "~ — ho, 


lquations (9-72) and (9-73) can be rewritten to give e; and e, in terms of 
(he dimensionless variable w,¢. These are the parametric equations of 
\he trajectories in phase space e; — e, and are plotted in Fig. 9-17. 
The solid and broken curves of Fig. 9-17 correspond to d6/dt = nm, —Qm, 
respectively, The two trajectories marked 1 and 2 pass the origin and 
form the optimum switching boundary, 


(9-75) 


ée = 


———— Te 
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The operation of the system can be explained with the help of Figs. 
9-16 and 9-17. In Fig. 9-16, the controlled system includes both the 
aircraft dynamics and the integration from elevator rate to elevator angle. 
When the contactor is closed and a positive input ¢: is applied to the 
system, the error voltage ¢: causes 5 to increase at the rate Qn. The 
representative point of the system follows curve 3 into the region of 
negative f. However, because of the overriding influence of the error 
voltage é1, 2 remains positive until 
the representative point reaches a 
far enough point a to reverse @2. 
At this point the contactor opens 
and es suddenly changes into a large 
negative voltage. Consequently f 
reverses sign and the representative 
point follows trajectory 6 to point 
b. At b, f reverses sign and the 
representative point follows 1 to the 
origin, near which point the con» 
tactor closes. There is a net gain 
in the output variable c. (In this 
case c represents the pitch linear 
acceleration a.) If the nonlinear 
element 1 is properly designed, the 


Frc. 9-17. Trajectories in modified phase 
plane. (Reprinted from Sheldon 8. L. 
Chang, Proc. Natl. Electronics Conf., vol. 
12, pp. 134-151, 1956.) 


analog computer. 
angle 6, pitch linear acceleration @ 
angle of attack a, and pitch angi 
velocity q are all expressed in per-unit quantities. In graph b the ordered 
change in a of two units corresponds approximately to the maximu 
possible a, because of structural limits. We note that the elevator ai 
6 always moves at a maximum rate and reverses twice during the wh 
process, as required by theory. There is little overshoot ina. However 
there is a slight dip in a when 6 begins to move upward at the start. 7 
is due to the nonminimum-phase effect described in Sec. 4-7. 
Figure 9-19 illustrates the system response to load disturbance of 1 
nature of a pitchwise torque impulse. With the control on, and 
impulse applied at ont = 0, the system disturbance is reduced to 
very quickly. In contrast, when an equal and opposite impulse is app 
at wat = 8, the control is off, and a persistent oscillation is observed, 
note that the controller serves to damp out later oscillations comple 
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Fra. 9-18. System response to step inputs. (Reprinted 
Proc. Natl. Electronics Conf., vol. 12, pp. 134-151, 1956.) ei ai 
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but is not very effective in suppressing the first lobe. This effect is 
rather characteristic of tail-controlled steering systems in which a rela- 
tively small surface is used to control a much larger body. The relatively 
small control force is adequate to produce desired change as it is magni- 
fied by the large number of integrations inherent in the system dynamics. 
However, as it takes time to integrate, the effect of control on the system 
is always felt some time later. ; ; 
This type of control can tolerate a fair amount of component iac- 


curacies. Figures 9-18 and 9-19 show system responses with a stabilizing 


2) 


12 


0.8 


-0.8 
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tea Se 


Fic. 9-20. Theoretical and actual switching boundaries in modified phase plane, 


(Reprinted from Sheldon S. L. Chang, Proc. Natl. Electronics Conf., vol. 12, pp. 134-151, 
1956.) 


unit having a switching zone as indicated by the broken lines of Fig. 9-20) 
In contrast, the theoretical switching boundary is shown as the solid lin 
As the performance indicates that a deviation of this order is tolerable 
the electronic design does not present a problem. 

9-12. Systems with Multiple Limits. It is not unusual for a controll 
system to have more than one saturation limit. At times this difficult 
can be removed by properly selecting the controlled variable. ‘Ta 
the previous example of aircraft-pitch control; there are at least the fo 
lowing saturation limits: 


1. Structural limit, which limits the linear acceleration a 
2. Angle of attack limit, which limits a 

3. Elevator angle limit 

4, levator rate limit 
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The choice of a as a controlled variable removes 1, as we may simply not 
order too large an a;. Since the elevator force is small, the angle of attack 
a is approximately proportional to a in its linear range. At low speed or 
high altitude or both, 2 is more of a restriction than 1, and we may simply 
not order an a; that requires too large an a. The linear or allowable 
range of 6 is usually large enough to cover the range of a or a in steady 
state. It does not present a problem if the value of 6 during the transient 
period is not higher than its final value. Figure 9-18 shows that this is 
the case with predictor control except for very small values of the ordered 
change a, in which case 6 is small also. Thus the only limit left is the 
elevator rate limit. 

Another possibility of saturation is in the electronics itself, owing to the 
presence of the quadrature component of the carrier, harmonics, or some 
other sources of sinusoidal noise. Such saturation can be removed by a 
rearrangement of the electronic components and is not essential to the 
system. For instance, sinusoidal noise, except the quadrature carrier, 
can be removed by a notch network in the preamplifier stages. The 
quadrature carrier can be removed to a large extent by careful balancing, 
and the remainder can be removed by moving ahead the detection stage. 
To accept a nonessential limit not only complicates the mathematics of 
the problem; it also makes the solution nonoptimal, since we can make 
a better system by simply redesigning the electronic circuitry. 

The above illustrates the necessity of examining a problem carefully 
before we come to the conclusion that the system has multiple saturation 
limits. In such a case, the solution is usually complicated, and we shall 
limit our study here to a relatively simple case: 

A highly idealized ship-steering problem can be formulated as follows:t 


rer _ Ka12) (9-76) 


where y is the ship heading and 4 is the rudder angle. The rudder-angle 
and rudder-rate limits can be expressed as 


]6(t)| < bm (9-77) 
fa | S% (9-78) 


‘The optimum solution is that at any time either one or the other of the 
following conditions must hold: 
[al Sn (9-79) 


= Abn (9-80) 
| This material is condensed from H. G. Doll and T. M. Stout, Design and Analog- 


computer Analysis of an Optimum Third-order Nonlinear Servomechanism, Trans. 
ASME, pp, 518-525, April, 1957. 


——————— ON 
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First, let us consider the optimum rudder movement for a desired step 
change in 7, say v1. At the desired steady-state condition, 

y¥=0 (9-81) 

y=0 (9-82) 

Let 0 and ¢, denote the initial time and final time, respectively. Equa- 

tions (9-81) and (9-82) require that 
6(t) = 0 fort > t1 (9-83) 
J, s@ a = 0 (9-84) 


Equations (9-79), (9-80), (9-83), and (9-84) give the 6 versus {, 7 versus 
i, and y1 — 7 versus é curves of Figs. 9-21 and 9-22. Figure 9-21 illus- 
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Fig, 9-21. Optimum controlled move- Fie. 9-22. Optimum controlled mo 


ments for small step inputs. ments for large step inputs (third-or 
system with two limits). 


ty 


i trates the situation that the ordered change in 71 is small, and con 
quently the rudder-angle limit has never been reached. Figure ()-' 
illustrates the case with large ordered change in yi. The dividing line 
; _ , 5m 

i = 4 o, (1) 


The net change in y can be obtained by integrating Eq. (9-76), i 
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equal to 46,,/2,, or less, 
_ KQqt 
1 = “307 - (9-86) 


The above simple solution can be readily generalized to include cases 
with ramp input by modifying Eq. (9-84). 

While the above direct solution is quite simple, the switching bound- 
aries in phase space are not so easily determined. First we shall write 
the system equations in terms of per-unit quantities: 


eae 
5m 
TQ? 
ae aon) 


With step and ramp inputs, Eq. (9-76) becomes 
de 6 


a Se alts (9-87) 
In Eq. (9-87), T represents 6/5,. Equations (9-77) and (9-78) become 
IT| <1 (9-88) 

aT | 1 
ae (9-89) 


Let the phase-space variables x, y, z be chosen as 


de de 
r=—e = _— = 

Ye or a ari 
l’rom Eqs. (9-87) and (9-88), we see that the phase trajectories are bound 
between two planes with zg = +1. On the upper plane, z = 1, and the 
phase trajectories are the set of parabolas 


r— 21 = Wy’ 
Similarly, on the lower plane, the phase trajectories are 
ein, oye 


In between the two planes, the phase trajectories are obtained by solv- 
ing the differential equation 


d’e 
ga eee 
The result is a set of parametric equations: 
t=e=mtyrt ores 
Pde r? 
eae em) + zor + 9 (9-90) 
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The switching boundaries can be obtained by the reversed-time tech- 
nique. Let 7, represent the required time to reach the origin. The 
final switching boundary is for 71 < 1: 


i 
P= 6 
£ 
y= iD (9-91) 
C= ET 


The switching surface or first switching boundary is obtained from Eqs. 
(9-90) and (9-91): 


a. ets T1°T2 7172” te 
wie (G a 9 ae 9 ;) 
2 2 
paren es) fm) 
2= +(1— 72) 


where 72 is the time required to reach a point on the final switching bound= 
ary of Eq. (9-91). An equation relating x, y, and z can be obtained by 
eliminating 7, and 7» from the set of Eqs. (9-92). For the upper signs 


Eqs. (9-92), 
qs. (9-92) . s : 
r= -2(y +3) - (F +1) (9-93 


For the lower signs of Eqs. (9-92), 


x= 2(v-$)+(§-) ( 


The switching surface can be represented by a set of constant z lines 
the zy plane, as shown in Fig. 9-23. The final switching boundary is 
broken curve HOF. 

A typical three-dimensional system trajectory is represented by 
broken curve ABDFO. Let us assume that the system starts at point 
with T = 0. Since this point lies above the switching surface, a posi 
movement of the rudder results, with d7'/dr = 1. The representatl 
point P moves in the —z direction while its projection on the xy p 
follows the curve AB. At point B, P reaches the lower plane 2 = = 
Thereafter 7’ = 1 and P follows the parabolic trajectory to D, where 
intersects with the switching surface. After D,dT/dr = —1 and [’ 
low the trajectory to F, which is a point on the final switching boun 
At F, dT/dr becomes 1, and P follows the final switching boundary to 
origin. A time representation of 6, é, and e is shown in Fig, 9-24, 
differs from Fig. 9-22 in its lack of symmetry, and the difference 
explained by the presence of a ramp input. 


(ime control of a syste i i i 
fried nein cae ae with simple saturation and deterministic inputs 
’ , otc,), is shown that the desired final state is reached in 
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Fra. 9-23. Projection of j i Switching surface on 2 ane 
, phase-space trajectories and itching f y p 
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Fira. 9-24 i 
1a. 9-24, Optimum controlled movements for mixed step and ramp inputs 


9-13, , . Pant, 
a Ct The maximum principle of Pontryagin for minimal- 
: ol o nonlinear systems is derived. While this broad principle 
'y have other applications, its immediate application is in the minimal- 
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the least time by operating the controlled system at its saturated values 
in either one direction or the other. If all the poles of a controlled system 
of nth order lie on the negative real axis including the origin, the desired 
final state is arrived at by reversing the input to the controlled system 
nm — 1 times. If there are oscillatory poles, the required number of 
reversals may be greater than n — 1. If the system is unstable uncon- 
trolled, the phase space is divided into convergent and divergent regions, 
and the above observation applies only to the convergent region. 

A controller that realizes the above principle by simulating optimum 
switching boundaries in phase space is called a predictor type of controller. 
The switching boundary of a second-order system can be simulated by 
conventional diode shapers and amplifiers. While an ideal system with an 
infinitely thin switching boundary requires an amplifier of infinite gain, 
the use of finite-gain amplifiers does not slow down the system to any 
appreciable extent. In fact, the use of amplifiers with finite gain has an 
advantage in itself: Once the desired final state is approximately realized, 
the system automatically becomes a linear feedback control system with 
lead or proportional plus derivative compensation. 

The simulation of switching boundaries of a system of third order or 
higher is a serious hardware problem. However, the problem can be 
alleviated by using a modified version of phase-plane control, which 
reduces the error derivatives to zero by the predictor technique, and a dis 
lodging unit which operates the system in one extreme value just long 
enough so that, after subsequent stabilization, the desired change in the 
controlled variable is realized. 

If the transfer function of the controlled system has a numerator 
polynomial in s other than a constant, the representative point skips to # 
new position in phase space at each switching. This effect can be elimix 
nated by using a new set of phase-space coordinates which are signal 
derived by mixing the error derivatives with the input or actuating signal 
to the controlled system. 

In the presence of noise, prefiltering of the error signal prior to predi¢ 
control is necessary. The switching boundaries should be advanced | 
position to compensate for the filter time delay. 

The problem of multiple saturation limits can usually be avoided by 
careful examination of the control problem to separate the essential limi 
from the nonessential limits. However, there are systems in whie 
multiple saturation limits are unavoidable. The problem has not | 
solved except in the simplest cases, an example of which is given. 


CHAPTER 10 


SELF-OPTIMIZING SYSTEMS 


10-1. imizi 
on Naat olsen nese tosireti 4 cto <8 In the previous chapters, 
3 ned mainly with the synthesis of 
which both the plant d i i Spar pie 
i ynamics and the statistical pro i i 
perties of the i 
pee bc The optimum performance as well as the ences 
or realizing it can then be determined. However, in man 
} y 


applications, the statistics of the inputs and the plant dynamics are not 


completely known or are gradually changing. It becomes necessary to 
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Fria. 10-1. Basic elements of a self-optimizing system. 
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either open- or closed-loop, depending on whether the measured quan- 
tities are affected by the subsequent adjustments or not. 

Functionally speaking, self-optimizing systems can be classified into 
three different types: 

1. Input sensing. The statistical properties of the inputs are uncer- 
tain. The adaptive controller measures these properties and auto- 
matically makes adjustments on the system controller. In order to 
make this type of system function, the designer has to know in advance 
the optimum adjustments for each set of measured data. The adaptive 
control is usually open-loop. However, it becomes closed-loop if some 
variable of the system is measured instead of measuring the inputs 
directly. 

2. Plant sensing. Some parameters of the plant dynamics are uncer- 
tain. These parameters are measured, and the system controller is 
adjusted to match the parameters of the plant. There are three ways of 
estimating the plant parameters: 
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a. Measuring the input and output variables of the plant 
b. Measuring the input and the output variables of the closed-loop 
system 
-¢. Indirect estimation of the plant parameters by measuring some other 
physical entity, such as the forward speed of a ship in a steering system 


In (a) and (b) the input variables may be natural inputs to the system 
or may be a test function, depending on the type of application. I 
systems where stochastic inputs are constantly present, existing inputs 
are usually adequate. In other systems where the natural stochastié 
inputs are negligible and our problem is to hold the system in constay 
readiness for occasional command, test functions are usually used. 
computing device for determining the system parameters from inpu 
output measurements is simpler when a test function is used. 

The adaptive control is open-loop for cases @ and c and closed-loop f 
case b. 

3. Extremal sensing of certain plant performance criteria. In 
type of system the adaptive controller measures directly certain 
formance merit criteria (e.g., mean-square error or cost function of a 8@ 
system) and adjusts the system controller to seek a maximum or minim 
of this quantity. The adaptive control is closed-loop. It differs f 
type 2b in that the maximum or minimum value of the given performa 
criterion is usually unknown, whereas in type 2b the desired setting 
the controlled parameter is usually known and is not at its extreme Vv 
(An example is the damping coefficient of a closed-loop system, wi 
desired value somewhere in the range of 0.6 to 0.9.) 
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In the present chapter we are concerned with the basic principle of 
adaptive control systems. Examples are given to help explain how the 
systems work and how to design these systems in situations that are not 


too critical. The next chapter is devoted to analytical studies of the two 
essential functions characterizing a self-optimizing system: 


1. Learning or measuring 
2. Adjusting 


It also discusses the best way of ishi 
y of accomplishing th 
varying situations. “ abil iia 
§ ay Se with Input Sensing. Input-sensing systems may be 
ie oe for two purposes: (1) optimum suppression of noise and (2) 
4 imum allocation of plant power. We shall study the first case first. 
iven spectral densities ®,,(jw) and ®nn(jw), the optimum closed-loop 


system function is 
Crono Oat) 
B®) = Zs) Eek 


5] / d Cc d 


(10-1) 


Example 10-1 


Plant transfer function: G(s) = as 
8 
Signal spectral density: ®,, = s 
f @ 
Noise spectral density: Pnn = B? 
Py, = Brn = 


The signal-to-noise ratio A*/B? may vary slowly. 
Solution. Equation (4-27a) gives 


2 == + Bt = (B +4) (2-4) 


Therefore Z = B + A/s, and Eq. (10-1) gives 


SRS AC 
R A+Bs~ 1 + (B/A)s ee 
The form of Eq. (10-2) suggest i i 
och air Boa : uggests that a straight amplifier Ka would do. The closed- 
f@) =;72 
EY “Te, a 


Comparing Eqs. (10-2) and (10-3), we see that 


1A 
RB (10-4) 


v 
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Since the signal-to-noise amplitude ratio A/B is known by measurement, Ka is 
adjusted proportionately. The complete system is shown in Fig. 10-2. 
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The value of B is unknown or uncertain. Because of heat loss or saturation, the input 


to the plant is limited: 


[ egal Hessen 
| 

| ! 

k Computing 

! 
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| 
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r(t)+n(t) 


Fra. 10-2. An adaptive system for optimum suppression of noise. 


The above example illustrates an extremely simple case for which & 
single gain control is adequate. However, in general, coordinated 
changes of a few parameters such as gain, feedback constant, and network — 
time constants are necessary. The design principle is nevertheless the 
same. 

10-3. Systems with Controlled Plant Activity. As discussed in Chap, 
4, when load disturbance is present, plant saturation or heat loss is usually 
a major consideration in system design. The problem is usually up 10 
how high a frequency range we wish the controller to counteract the 
effects of load disturbance. Beyond this range the disturbances 
left to the natural attenuation of plant inertia. When the magnitude 
load disturbance is high, we compensate only for the frequency rat 
where the controlled system is most sensitive in its response. 
compensated range is increased as the magnitude of load disturban 
lessens. 

The above design concept can be carried out by either open-loop 
closed-loop adaptive control. The latter is usually preferable. T 
following examples illustrate how to design and instrument the two ty 
of systems. 


Example 10-2. Known data are: 


Plant transfer function: G(s) = Ss 

: ; A? 
Signal spectral density: Mee} 
Load torque disturbance: br, = Bt 


7 = Or = 0 


2<M (10-5) 


Solution. ; To determine the optimum system, the load torque disturbance is con- 
verted first into a positional disturbance: 
B 
ial", 


Equations (4-11), (4-31), and (4-32) give the optimum system functions 


C 2 
= (8) = Fi(s) = ———-+—___ i 
m R 2+ V208 + 0 so: 
§ (e) = 1+ Fal) = > —_ t 
D s? + V2 018 + 0: 
ie ey Ls 
where oo = NZ (10-8) 
and k is Lagrange’s multiplier. 
To determine k or w; in terms of B, we have 
5 se cre! = = 
B= 5 | "GG Pibite + Fababas) ds 
i 1 i —A2J%1432 + B2(w14 — 2w12s?) 
fig J oka K%6* + 014) - (10-8) 


Equation (10-9) can be readily evaluated by contour integration, as shown in 
Appendix D. The result is 
i as A2*J%913 + 3B%w, 


1? = = ie 
2/2 K? 


(10-10) 
Equation (10-10) gives w as a function of B. 

The block diagram of the system is shown in Fig. 10-3. In order to obtain optimum 
control, simultaneous adjustment of the amplifier gain K, and feedback constant Ky 


[ieee Jp — 
Y | 
A 
| 
Load 
Y disturbance 


Mia, 1063. An adaptive system for varying load disturbance, 


Te 
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is necessary : From Eqs. (10-10) and (10-11), # can be expressed as a function of Kz, A2, and B?: 
a) ? . 


2 
a ow (10-11) J \¥ 9 \% 
a (sz) KaiA? + (Ge K,¥B? 
x, -¥2 (10-12) a Pine oi 
E ere 
If variations in A? and B? are slow, the system is expected to track very well. Change 


sible to adjust K. alone or K, alone and obtain an optimum setting in in Z is expected to be small, and linearization can be used: 


While it is pos 
the result is not as good as that obtained 


the sense of Phillips’s procedure (Ref. J1), 


here from Wiener’s procedure. 
Example 10-3. Another way of obtaining substantially the same result as Fig. 


10-3 is shown in Fig. 10-4. The function [i(¢)]? is measured directly and compared 


6= = 


ot 0% 0 
aK, Ke + 9 q2 04? + app OB? (10-15) 


The partial derivatives are calculated from Eq. (10-14). Figure 10-5 represents a 
type 1 system. We note that, under the assumption that variations in A? and B? 
are slow, a low gain Ka can be used, and the adaptive loop is stable. If variations in 
A’* and B? are fast, the filter and Ka determine the effectiveness of the adaptive con 
trol loop. The problem requires further study (Sec. 11-10). “ ie 


Desired level of 
plant activity M 
{ 


Equivalent 
noise 
tot 
Auxiliary servo 


Mia. 10-5. Equi i % 
Wig. 10-4. quivalent block diagram for the adaptive control loop of the system of 


10-4. Systems with Plant Sensing by the Use of a Test Signal. If the 
plant parameters vary with a certain readily measurable quantity in a 
predetermined manner, the simplest adaptive control is obtained by 
changing the system controller according to the measurable quantity. 
In steering systems, practically every constant of the dynamical equations 
silage ail dopane on the forward speed, and the system controller is usually 

y= Es adjusted according to the forward speed. An engineering problem arises 
however, when the variations in plant parameter are not povduiacnins 
: he emphasis of the present section will be on cases a and b mentioned its 
Sec. 10-1, where the variations in plant parameter have to be measured 
directly. The measuring techniques for both cases are the same. How- 
over, as case b has a closed adaptive loop, an analysis of the adaptive loop 
is also necessary. We shall discuss the measuring techniques first. 


Fiaq, 10-4. An adaptive system with automatic input-level control. 


with the set limit M. ‘The difference is filtered to average out the random varia 
of [i(t)|2. The gain K. is adjusted upward at a rate proportional to M = |i(¢ 
while Ky is varied simultaneously so that at all times 


The adaptive control is closed-loop and is illustrated in Fig. 10-5. The varinh! 
here represents [i(¢)]?. The controlled variable is the expected value 2, and 
random fluctuation « — Z is treated as noise. Variations in input and load cis 
ance levels A and B are treated as load disturbance in the adaptive loop. 
only variations in Kq are shown explicitly in Fig, 10-5, the feedback constant 


understood to vary according to Iq. (10-18), 


——— EO “"— 
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One thing is common among plant-sensing self-optimizing systems 
that have proved practical or at least seemingly so in the breadboard 
stage: The form of the transfer function or the pattern of the poles and 
zeros is assumed known, and only one or two critical parameters are 
to be measured. This is more or less what one would expect. If we 
could make an adaptive controller that assumes nothing on the controlled 
plant, makes all the necessary measurements, adjusts itself accordingly to 
give satisfactory control performance, and does this sufficiently fast to 
track any change in the environment or plant dynamics, and with suffi- 
cient economy to be practical, we should have no further use for engineer- 


ing departments. 


Presently we shall not try to build a universal adaptive controller but 
shall limit ourselves to objectives which are much less ambitious. The 
measured quantities are usually the damping factor and natural frequency 
of the system, either open- or closed-loop, and the general form of the 
plant transfer function is assumed to be known. With this limited 
objective, and using a test signal, the measurements can be quite simple, 
In general, the test signal does not have to be applied at the input nor the 
measurements made at the output. These should be done at places most 
indicative of the response characteristics that we wish to determine while 
causing least disturbance to the system. Classified according to the 


nature of the test signal, there are three types of measurements. 


1. Frequency-response Method. If we know definitely the form of the 
transfer function and also that the so-called minor time constants are 
so small that their effects are totally negligible, gain and phase measure 
ment at a single frequency somewhat higher than cutoff is quite adequate 
to determine the system’s response parameters. For instance, consider 


a second-order system: 


Wn? 


82 + 2Wuns + wn” 


5) = 


The parameters ¢ and wn can be easily determined from the magnitude 


and phase of (C/R) (je) for a single w. The reasons for choosing @ > 


are that (1) the regular signal level is low for w > w, and consequently 
weak test signal is all that is necessary and (2), other things remaitil 
equal, the time required for determining gain and phase is inverse 


proportional to w. 


2. Impulse-response M ethod. One way of determining & syslom 


stability is to examine its response to a test impulse. Of course, the 
signal is not an ideal impulse but one with finite pulse width and 
magnitude. As long as the pulse width is considerably narrower than 
reciprocal of the system’s bandwidth, the system’s response approxi 
impulse response closely, 


(10-10) 
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If the response characteristics of interest are not appreciably masked 
by other servo signals in the system, adjustments can be Ai nats 
basis of each individual impulse response. Otherwise, averagin pas 
the responses to many periodically applied impulses seve be Pasa 

Be Correlation Method. If the input to a system is white noise Hie 
correlation function dir(r) between input z(¢) and response v(E) is + : 
tional to the impulse-response functon h(7). In practice, it is onl re xi 
sary for 7(t) to have nearly constant spectral density 2 the telat 


ee in which H(jw) is appreciably different from zero. 
ince 


®i,(jo) = O;(jw)H (jw) = const X A(jw) 


an inverse transform of the above equation gives the desired conclusion 
The determination of the impulse-response function by correlation as 
originally proposed by Y. W. Lee (L3). In a recent paper (A1), Ander- 
son, Aseltine, Mancini, and Sarture proposed a method which a fairl 
easy to instrument. The test signal 7(¢) has amplitude + J, where J ie 
constant, and switches from plus to minus, and vice eres at random 
intervals. The average rate of switching is many times highat than the 
servo bandwidth and consequently the spectral density (jw) is approxi- 
mately constant over a few times the servo bandwidth ‘ The ate i 
instrumentation is seen from the following equation: a 


Piv(T) meas ~ Ut — T)v(L) = v(t) X sign of a(t — r)I 


Thus a switching circuit triggered by i(¢ — 7) can be used instead of 
multiplier in obtaining the product 7(¢ — 7)v(t). The time averagin i 
done by either a low-pass filter or integration, as discussed in Sec 7A "In 
© mete, proposed by Anderson, Aseltine, Mancini, and arti, a sek of 
( me signals u(t — tr), m = 1,2, ... , 12, is recorded on a multichan- 
ne magnetic tape. By applying these signals to trigger different switch- 
ing circuits, ¢(7) with 12 different 7’s can be obtained simultaneousl 
Once h(r) or the equivalent H(jw) is determined, there remains the 


question: How do we make use of this information in adjusting the con- 


troller of the main servo system? Usually it is necessary to compute 


from h(r) or H(jw) the values of some key parameters that we wish t 
control. One possible criterion is the area ratio suggested by eee 
et al. It is defined as A,/A_, where A, and A_ are the total positive 
and negative areas, respectively, under the h(r) curve. Since h(n) is th 
lime derivative of unit step response, A, measures the total u nly 
movement of the unit step response curve, while A_ measures Peat 1 
downward movement. The area ratio is a good indication of the o ' 
shoot or oscillatory tendency of the system. Once we decide ere sath 
(nin area ratio & = A,/A_, we can use as the adjustment signal a ‘Agure 
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of merit” F, defined by 
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F, = Ay — fA- 


A positive F, indicates that the system is overdamped, and the reverse is 
true if F, is negative. F. can be easily determined from h(r,) by diodes 
and summing circuits. 

Each of the three methods mentioned above has some comparative 
merits, depending on where applied. The frequency-response method is 
simplest to use but is limited by the requirement that one must possess 
more knowledge about the controlled system. For instance, the exist- 
ence of a minor time constant 7 with 1/T a few times higher than the 
servo crossover frequency , can change (C/R)(jw) very substantially, 
since w of the test signal is larger than w-. 

The impulse-response method is also simple to use, but one must be cer= 
tain that the desired response characteristics are not masked by the pres= 
ence of other servo signals. We shall see this in an example later. 

The correlation method is more generally applicable, and both the 
correlation method and frequency-response method are relatively immune 
to the presence of other servo signals. However, the correlation method 
introduces a “noise” of its own into the adaptive control loop because 
over any finite interval ®i(jw) cannot be constant on account of the 
randomness of i(t). These points will be discussed quantitatively in 
Sec. 11-4. 

Of course we can also determine the system response from measu 
ments made on the servo signals already present in the system witho 
the introduction of a test signal. We gain by not causing any distu 
ance. However, as the presence of such signals cannot be depended u 
all the time, the measurements have to be made over a longer perio 
A comparison between the two alternatives will be given in Sec. 11-4, 

10-5. Stability Analysis of the Adaptive Loop. In some applicatio 
the change in system dynamics is relatively slow, and it is possible 
detect such change quickly and accurately. Our problem then is 
design a simple and stable adaptive loop. 

If there is only one closed-loop performance parameter u to be m 
ured, and correspondingly only one adjustable parameter x, the mit 
servo loop can be represented as giving a value of u for each value of 
and y: 

u = u(x,y) 
where y stands for one or more variable parameters of the plant tra 
function. Let uo be the desired value of u. One simple way is to ad) 
x at a rate proportional to w) — u: 


« = K(uy — u) 
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To analyze the system’s stability, let us assume that, after some initial 


change, y remains constant thereafter. Then 


d 
ai le ~ u)? = —2(uo — u) =— 


0 
~2K = (uo — u)? 


We note that (uo — u)* decays exponentially as long as K and du/dz are 
of the same sign. We come to the following conclusion: An adaptive 
control loop with a rate of adjustment proportional to the measured oe 
in the performance parameter is always stable if (1) the time delay in Metis 
urement is negligible and (2) the performance parameter is a monotont 
function of the adjustable parameter. ya 
Sometimes it is desirable to control more than one response parameter 
of a system, and we are likely to find that the adaptive control loops are 
not. independent. While the main servo system has a single controlled 
variable, the adaptive controller has a number of interdependent rank 
bles. It is possible, of course, to separate the various adaptive loops 
physically at the cost of a computer of varying degrees of peed eet) 
We: on heaped But such a computer is usually anheaee 
8 ince m i ili 
ne! : tiara Ag adaptive control loop can be stabilized by some 
To analyze the situation, we have in general a set of adjustable param- 
eters Xi, 2, . . . , tn; a set of variable system parameters y;, y 
Ym; and a set of response parameters wu, U2, ... , U ee 
system gives u’s as functions of x’s and y’s. eid 


ne: 2 
The main servo 
UE UNE cd ohio 


- Un5Y1, - + Yky s+ 


lor any given set of values of y’s (Y1a,Y2a, . 5Yma) Which may 
occur under actual operating conditions, there is a set of values of 2’s 


a i 
ne - + + ,€na) such that the desired response characteristics can be 
net: 


~ Ym) 


Ube. = Us0y se i= 


dst (10-18) 


We note that the existence of a set 210, 222, . » tna, to satisfy Eq 
(10-18) for any possible set y1a, Yoa, » - - , Yma is simply shicthae way of 
auying that we must provide an adequate range of adjustment so that 
the desired operating point is not out of range. As an example, consider 
the second-order system shown in Fig. 10-6. Its closed-loop ceeniiia is 
niven by Eq. (10-16); and both ¢ and w, are to be controlled. This is 
done by introducing tachometer feedback Ky,s, as shown in Fig. 10-6 and 
adjusting the loop gain pot K, and feedback constant Ky. In terms of 
the more general terminology, K, and Ky, are the adjustable parameters 
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x, and t. The constants K., Kr, J, and 7 are variable system oe 

lie Yr» Y% Ys, and Ys; On and & are the response cree amas a pie: 7 

Obviously, the adjustable ranges of K, and Ky; must onee ai _ a 
ield desired values of wno and {0 for whatever combina ion of val 

K. Kr, J, and » may occur in practice; otherwise, the realization o 
a) ’ ’ 


is not possible. en 
ate wh of values yi, Y2, - + » Ym vary gradually with time, eo 
Bie az, Which satisfy Eq. (10-18) also vary with time. Let the 
Ya cea 


Reference phase lag 


| 
eb PRASCY Wie tye oe ee 
ian = a eee Te a eien ial y 


Phase} lag 


| 

| 

t ! 

| O.--- 

: ‘ s Se Lee 

te-[ewe fey f & 

h iy 

| | 
Bandpass 
filter 


| 
A 
| 


—» —-e—— — >—— 


e(t) 


Fra. 10-6. Block diagram of a servo system with controlled ¢ and wn. 


i troller 4 
latter set be denoted 210, 220, ..- +, no. The sor ae ~ 
i that the actual parameters 21, 2, . . + +n 
Lise ny tno. A block diagram of the adaptive loop is shown 
10; 20) - + + 9 n0- 
ig. 10-7, where 
o €: = Lin — Xi Ge 2M ene aie 
ai = Un — Vi Go ee eee 


The auxiliary servo is usually linear and can be expressed by 
Laplace transform equations: 


X,(s) = Y Hisls) Ai(s) fad. ae 


SELF-OPTIMIZING SYSTEMS 267 


For small errors, the relation between a; and e; can be linearized: 


a; = ) Bye; (10-22) 
7 
OU; 
her B,; = 2% 
where j ss 


The partial derivatives are evaluated for the present values 

. » Ym and corresponding values of X10, L209, - »%o. We note that 
aS Y1, Y2, - » « » Ym are functions of time, the coefficients B,; are also func- 
tions of time, and an exact analysis can be difficult. However, we can 
always approximate the y’s by a series of step changes, and within each 
step, the y’s can be treated as constants. If the adaptive system behaves 
satisfactorily for all possible values of y, it behaves satisfactorily for the 
step approximation of y’s, which is not substantially different from the 


of yi, Y2, 


%19 Ui ar i 
%29 £2 a 73 
H H i ; 
H H j Error-_ ! H 
: e Main servo | indicating a x. 

and parameters P Auxiliary zs 

measuring servo 
system 


lia. 10-7. Generalized block diagram of a system with multiple controlled parameters. 
actual situation. On the other hand, if the adaptive system is unstable 
or unsatisfactory in some other way for a possible y set, the adaptive 
system needs redesigning, because the y’s may take on that set of values 
for some length of time. 


The system response can be readily analyzed in matrix notation. Let 
X, Xo, A, and E denote column matrices 


Xi(s) 
Kio) = | B20) 


Xn(s) 


and H(s) and B denote rectangular matrices with H,;(s) and Bi; as their 
respective elements. Equations (10-19), (10-21), and (10-22) can be 
written as : 
E(s) = Xo(s) — X(s) (10-23) 
X(s) = H(s)A(s) (10-24) 
A(s) = BE(s) (10-25) 
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From the above equations, X(s) can be solved: 

X(s) = H(s)BE(s) = H(s)B[Ko(s) — X(s)] 

X(s) = [1 + H(s)B]-'H(s)BXo(s) (10-26) 


For arbitrary nonsingular matrices U and V, we have U-'V = (V-"U) = 
Using this relation, Eq. (10-26) can be written as 


X(s) = {{H(s)B-? + 1} *Ko(s) (10-27) 


Let T;,(s) denote the matrix element of I'(s) = [H(s)B]-* + 1, T(8) 
denote the cofactor of T;(s), and A(s) denote the determinant. The 
response X;,(s) due to an input Xjo(s) can be written as 

Pj.(s) 


Xi(s):= Als) X jo(8) 


Therefore, 


The poles of T'yi(s) are canceled by the poles of A(s). The closed-loo 
poles are simply roots of A(s). If all the roots of A(s) are in the LHP, the 
system is stable. 


Example 10-4. The above general analysis will be applied to the system ill 
trated in Fig. 10-6. Instead of measuring w, and ¢ directly, the gain and phase § 
of the system at some @; > ono are measured. It is assumed that either there is ap 
ciable random noise power at this frequency or, alternatively, a test signal of 
quency «1 is applied to the input. Two identical filters and amplitude detectors 
used at the input and output ends, and K» is set at the theoretical gain at wi: 


2 2 Ata? | 4 
m= 6 =[(%-1) + Hest] 
no @n0 


Let I; denote the average power of the filtered signal at the input end; then the 
error signal is 


a(t) = 11(K2 — @) a 


where G@ is the magnitude of the actual (C/R)(jw1) of the main servo system, 
error signal a;(t) is applied to an auxiliary servo which adjusts K, of the gain 
pot upward at a rate equal to K3a1. The phase lag ¢ of (C/R) (jo) is measured 
phase meter and compared with the theoretical value of ¢o = 7 — tan™*[2 ow 
(wi? — w%9)]. The phase error signal is applied to a second auxiliary servo to (i 
upward at the rate K,(oo — $). 

In terms of the general analysis, @ and ¢ take the places of w and us; Ay and 
take the places of a1 and 22. From Fig. 10-6, 


Ci ) = rset s Ugh ge 
RS) = TH 2ys/on + 8*/on? 
_ KikrKa 


where Wn J 
on Ky ” 
r= S(t KEK.) 
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From Eq. (10-31), 
10g 0 = — Mog [(1 — 28)? 4 AE" 
glee} (1-25) + ae: | (10-34) 
) =f — tan _ 2fer/on 
athad = & (10-35) 
From Kgs. (10-32) to (10-35) B:; can be evaluated: 
i is 0G _ GE OG dwn 
0K, 06 0Ki | d0,dKy 
a Gq? [ w? 
mai (3 = 1) + 4c] (10-36) 
By = OF = _ 2Gifur? 
3K ay (10-37) 
Bey « 28 cs, Me 
aK, ert, (10-38) 
Bog es oe i 
oh a 1) we 
: (10-39) 
The matrix H(s) is given by 
Hy(s) = BE (10-40) 
F112(s) = H21(s) =0 
(10-41) 
Ho(s) = — Ks 
‘ (10-42) 


It is assum i , 
ate a se bal eg ae constants introduced by the bandpass filter and detection 
aps nieega Brees time constants are of the order of a few times 1 ye and 
pxassighes tate e time constants of the adaptive servo. Asan ee 

7 @1 = 2wno, and ¢ = 0.707. The various constants are a 


Kz = @ = —.. = 0.24 
+ : 
¢o = 136.7° 
0.286 0.0403. 
a Ky Ki 
_ 0.166 0.17601 (10-43) 
Ky Ky 
H(s)B = z ‘totems —0.0403011Ks 
18 \0.166K,4 0.17601K, (10-44) 
3.08 0.706 
HB Kj “She Ks 
_ tot. 60 (10-45) 
wliK3 wik, 
A(s) =1 + GF a) 17.5K 7s? 
: Te eee Tacs a (10-46) 


lhe adaptive system obv ously stab eC m er W t the Fo 
| 18 10 ] 1 *, no atter ha 0 str, 8 
> con aints are, : 


any step cl i : i 
p changes in J, », Ky, and Ky, which result in step changes 6, and 69 for the 


tieoretienl : ‘ i 
values of Ky and Ky, respectively, the transient variations of Ky and Ky 
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are given by the following equations: 


& a + 5Kys/oik4 a =< 0.706K 162 

(8Ki)(s) = ACs) e K.A(s) (10-47) 
> a 1 + 3.08K1s/l1Ks 5s 2.916151 

(6Kys)(s) = A(s) eee TS (10-48) 


-hand side of the above equations is understood to be the 


The value of K, on the right 
for the present values of J,n, Kr, and Ka. 


one value that yields wno and fo 
are made on J, n, Kr, and Ka 
the adaptive control is open= 
paper, it is much more 
tions specified by Eqs. 


If, in Example 10-4, the measurements 
instead of the closed-loop response at 1, 
loop. While such a system appears simpler on 
difficult to instrument than the simple integra 
(10-40) to (10-42) of Example 10-4. 


10-6. Systems Using a Model. Sometimes the dynamics of the con» 
trolled system varies too fast for accurate measurement at servo signal 
and we can make the system’s response insensitive to such 


frequencies, 
tion with adaptive control. 


variations by the use of a model in conjunc 


r(t) 


Fra. 10-8. Block diagrams showing conditional feedback (a) and its equivalent 

The original idea of using a model was due to Lang and Ham (1), 
Fig. 10-8a, H(s) represents the transfer function of the measuring inal 
ments and is approximately 1 at frequencies for which M (jw) is app 
bly different from zero, and G(s) is a high-gain element. 
that any difference between C(s) and R(s)M(s) is negatively fed 
through G,(s), thus foreing the system’s response to R(s)M(s), A 
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is obtained from the equivalent block diagram of Fi 
a ere ae ie gain in G2(s), the system’s response is ans 
=> M(s), si i igi i 
ei , Since 1/G2(s) is negligible and H(s) is close to 
Pc caper there is no noise in the measuring instrument, no load 
ance, and no nonminimum phase or tran i : 
tation lag in G 
we can introduce a sufficient number in HS Lee 
of zeros in H(s)G 
the poles of G,(s) to these zer ee 
os and make the gain of G 1 
the system’s response to R(s) i Sch ie ee 
is completely dominated by M 
Vy 8). How- 
—_ kg dsia of the presence of the aforementioned effects a Wie 
not always feasible. Fleischer developed an optimum-linear-design 


Variable- gain 
SL amplifier 


Servo 
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Aircraft 
dynamics 


Excitation 
pes Switching |__| sharpener 
omic logic and 
‘ontro ; 
counter 


lia. 10-9. Essential com i 
y ; ponents of an a -pi i 
model and adaptive control of closed-loop eine yn ee ae 


ears eee the best compromise in the least-square-error sense 
system’s sensitivity to plant variati 
wforementioned effects on the oth Peiliniae came 
er. Because of space limitati 
not treat the subject h i mca eiitics 
ee j ere but refer interested readers to Fleischer’s 
A ; é : 
Rawr es is veil the gain of G2(s) so that the system is 
just scillate at high frequency. There ar i 
ious ways of doing this, as i i sy iene 
, as published in the Proceed i 
“inthis hae Beg ceedings of the Self-adaptive 
ymposium, January, 1959, at Wright Ai 
ment Center. Among these, 8. S. O 01) de epeeebene 
, 5. S. Osder (O1) described i 
mk i si escribed a method wh 
Ral si eee of generality and will be discussed here in its nie 
gure 10-9 is a block diagram representing the system. The oon 


p t, Fleische i i y Vv 
| ‘aul | E leischer, Optimum Design of Passi e-Adapti e, Linear Feedback iS) 
y ySs- 


toma wi Va i VOUS. 1D er - ) 3 c ontro 
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i rface 
trolled variable is the pitch angle 6, and 6 “A pr eg See Pe a 
: : ° 
i nt. Without counting the mode ' ; : 
oan K, G,(s), Ga(s), and H(s). Its cig ppt . ee 
The pit ics of a rigid aircra 
i - he pitch-control dynamics 0 
ius . ip ; ”. and P, and a zero at 21. dn Osder’s pant ae 
a ea by a pair of poles and a zero near the origin. ae re abs Pe 
if pi take into consideration the difference in vertical heig : a 
bs r of weight and center of wing force in the errenens ee a 
Ho feck with the loop gain that we are considering, one of the p 
ow ; 
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Fre. 10-10. Root loci of aircraft-pitch control system. 


vl 
ar the zero, and the result is practically the same as a pole Po ab 


os Aa e to a second-order servo act 


origin only.) The poles Ps and P4 Pe 
i i 8). 

tor, and z» is due to a lead term in 

ee but not so large as to cause the pers to be 
which are far away 

eee The system’s response from input to 0 is he 

However, such a critical gain value is Sr 

1 since the aircraft’s surface eff ectiv 

cause of wide variations in speed 


rigin and 


zero pairs at z; and 22. 
mately M’(s)/H(s). 
maintain without adaptive contro 
can vary as much as 10:1 or more, be 
altitude. : 

There are also large v 
may argue for canceling out the servo poles by a | 


) > 
ariations in the positions of / and Py — 
ead network and 


SELF-OPTIMIZING SYSTEMS 273 


the loop gain high enough so that, even with the least surface effective- 
ness, the aircraft and network poles are either very near a zero or far away 
from the origin. However, at high speed and low altitude, the loop gain 
becomes too high, and other minor time constants such as gyro poles 
and poles due to elastic aircraft structure are liable to show up and cause 
the system to be unstable. One cannot expect to introduce ‘sufficient 
lead networks to cancel out all these poles without leaving the system wide 
open to jamming by instrument noise and load disturbance. 
Osder’s adaptive control consists of the following: 


1. Applying a pulse of narrow width periodically at the input of the 
variable gain amplifier 

2. Counting the output oscillations of the servo actuator with ampli- 
tudes exceeding a preset threshold level by a pulse counter 


3. Adjusting the gain at a rate according to the count following each 
pulse 


The circuitry gives a slow decrease in gain at a count of three, a rela- 
tively fast decrease at a count of four, and a fast decrease at a count of 
five. The gain is increased if two successive counts of less than two are 
registered. Thus his logical circuitry tends to keep the count of sig- 
nificant oscillations at two. As shown in Fig. 10-10, the two high-fre- 
quency oscillatory poles are kept approximately at q: and qo. 

In summary, the use of a model or inner feedback loop in a self-opti- 
mizing system has two distinct advantages: 

1. The presence of an inner loop or model automatically makes the 
system’s response less sensitive to plant variations. 

2. It elevates the critical frequency at which the system’s response is 
measured. The relatively low power content of servo signals at these 
high frequencies and the fact that measurement is made at high frequency 
reduce the time required for accurate measurement by at least an order 
of magnitude. Thus the system is capable of tracking relatively fast 
variations in controlled system dynamics. 

10-7. Optimalizing Control Methods. The “optimalizing” control 
inethods originated by Draper and Li are perhaps the earliest work on 
extremal sensing systems. These methods are easy to instrument and 
are effective in the slow tracking situation. They are: 


Sensitivity-signal-input optimalizing controllers 
Continuous-test-signal optimalizing controllers 
5. Output-sampling optimalizing controllers 

4. Peak-holding optimalizing controllers 


I. 
» 
a 


! This material is condensed from ©, §, Draper and Y, T. Li, “Principles of Opti- 
Moalizing Control Systems and an Application to the Internal Combustion engine” 
(D7), The terminology is modified to be in line with the rest of the book, 


——————e— SS 
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dm/dx. Tf we increase « when dm/dz is positive and decreas 


dm/dz is negative, x would converge to Xo. : 
on this principle, and their differences are in the way dm/d. 


vity of merit m to parameter © is defined as 
e a when 


All the methods 1 to 3 operate 
xz is measured. 


In Fig. 10-11a, the sensiti 


Time—- 


(c) 


als in a sensitivity-signal-input optimalisi 
Draper and Y. T.. Li, “Principles of Optima 
to the Internal Combustion Engine,’ Amert 


Society of Mechanical Engineers, New York, 1951.) 


Fria. 10-11. Diagram illustrating the sign: 
controller. (Redrawn from Fig. 11, C. S. 
ing Control Systems and an Application 


Case 1. Optimalizing Systems of Types 1 and 3. Inasensitivity-sign 
input optimalizing system, the parameter z is adjusted at a constant ri 


a, either in one direction or the other: 


Ae +a (104 
dt 


The rate dm/dt is measured. If dm/dt is positive, ZeXO, OF not sufficien: 
negative, x is continually adjusted in the same direction. If dm/dt t 
below some preset limit —b, where b > 0, the direction of adjustment 
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is then reversed, and dm/di becomes positive again. The block diagram 
of the system is illustrated in Fig. 10-12. The system merit is measured 
and differentiated to obtain a signal proportional to dm/d¢. 

When dm/dt is sufficiently negative, it operates a reversing relay to 
reverse the direction of the adjustor drive. The operation of the system 
is illustrated in Fig. 10-11. If it is assumed that the random variance in 
measured m is negligible and the 
initial value of x is a, at this point 
dm/dx is considerably larger than [~~ ™— 
b/a. If the adjustor drive is in the 
direction of increasing 2 initially, 
dm/dt is positive, and it stays in this 
position. If the adjustor drive is in 
the direction of decreasing x initially, 
dm/dt is negative and larger in mag- 
nitude than 6. The reversing relay 
immediately operates, and the ad- 
justor drive is again in the direction 
of increasing xz. As 2x increases, 
dm/dx decreases until x2 is reached, 
ut which point dm/dx = —b/a and 
the reversing relay operates. The 
adjustor drive reverses direction, and 
wv is reduced until 2 is reached. 
Afterwards the system hunts between 
v, and x. Figure 10-11a illustrates 
m versus x; Fig. 10-116 illustrates 
im/dx versus x; and Fig. 10-11c, d, 
and e illustrate x,dm/di, and m versus 
lime, respectively. 

As the value of « hunts between 
(he two limits 21 and 2», the averaged tes Wri. Boehe, ingsaee OF “Sanat 

‘ - : ivity-signal-input optimalizing con- 
value of m is lower than its opti- troller. 
mum value. This difference is called 
hunting loss. It can be calculated as follows: For small deviations from 
(he optimum value, m(x) can be expanded into a Taylor series. The first 
(wo nonvanishing terms are 


System merit 
measurements 


m 


Differentiator 


dm 


dt 


Diode rectifier 
Reversing relay 
Adjustment drive 


Main servo 
system 


| | 
Y A 

| 
La eae ge ea cal Se 


” 


a See + (%» — x)? (10-50) 


Ikotween reversals, dx/dt is constant, and the time-averaged value of 
(ao — x)? is 


(@o = 2)? = 14 (to — a1)! (10-51) 


\] 
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Equation (10-50) gives 
m"’ 
Mop — M = 6. (x9 — 21)? = 14 (mope — m1) (10-52) 
where m, is the value of mat x = x1. Another way of writing the hunt- 


ing loss is to express it in terms of b. Differentiating m with respect to @ 
and setting x = 2, 


gio 


= m''(29 — 2X1) (10-53) 
Substituting Eq. (10-53) in (10-52), we obtain 


2 

Mopt — m= — (10-54) 
The hunting loss is proportional to b?. The smaller the margin b we givé 

to dm/dt before reversing, the smaller is the hunting loss. However, if we 

set b at too low a value, the random variation in measured dm/dt alone 

sufficient to cause the reversing relay to operate. Its operation t 

becomes erratic. The choice of b is therefore a matter of comprom 

tolerating either more hunting loss or more erratic operation. 

The choice of adjustor speed a is likewise determined by conflict 
requirements. The random variations in measured m are of relativel) 
high frequency. If not sufficiently filtered before entering into the 
ferentiating operation, such random variations are greatly exaggert 
by differentiation. The filter introduces a time lag which makes 
filtered value of m, and consequently dm/dt, represent past data ra 
than present data. The error so introduced rises sharply with iner 
a and is an effect which has not been considered in deriving Eq. (1 
On the other hand, a system with larger a is better able to cope with 
tracking situation. 

Output-sampling optimalizing controllers derive dm/dt in a diffe 
way. The value of measured m is averaged over one sampling period 
give a sampled value of m. The difference between two such suce 
samples is 7’ dm/dt, where T is the sampling period. Otherwise the 
tem is entirely the same as the sensitive-signal-input optimalizing 
trollers. The process of averaging over one sampling period is, in off 
filtering, and taking the difference between two successive samplot 
equivalent to differentiation. The random variances in sampled 
reduced as T is increased. However, there is a total time lag of 
sampling period in the computed value of dm/dt. The consideratio 
selecting the constants a and b are the same as before. 

In both types of systems, the initial settling period is inversely 
tional to the adjustor speed a. One way of shortening this pe 
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large initial displacements is to make da/dt proportional to dm/dz: 
ag dm 
Pd Ky We (10-55) 
: dm dmdz 
8 Fe ae 
ince di ae ai (10-56) 
dm/dx can be eliminated between Eqs. (10-55) and (10-56): 
ae dm 
i at jae (10-57) 
The sign of dx/dt is always so selected as to make dm/dt positive. Equa- 


tion (10-57) is fairly simple to instrument. 


Test-signal 
generator 


w I cos w,t 


Phase- shift 


Correlation 
network detector iat 
Adjuster System merit 
drive measurement 
Main servo 
system 


Fig. 10-13. Block diagram of continuous-test-signal optimalizing controller. 


Next, we shall study the behavior of the system under the condition of 


negligible random variance and time delay in measured dm/dt.  Dif- 
ferentiating Eq. (10-50), 


dm r 
Gg = 1 (eo — 2) (10-58) 
Substituting Eq. (10-58) in (10-55) gives 
dz r 
Fhe Kym''(xo — 2) (10-59) 


‘The solution of Eq. (10-59) is 


%—-xr¢t= (xo — xi)e—Eim"t (10-60) 


‘The parameter error reduces exponentially with the characteristic time 
constant 1/K ym", 


Case 2. Optimalizing Systems of Type 2. Figure 10-13 illustrates the 
exsential components of a continuous-test-signal optimalizing controller. 


——— 
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A sinusoidal test signal is superimposed on the slowly varying component 


Bq Of 
2=%+/sin ait (10-61) 


where w; is the test-signal frequency in radians per second and J is the 
test-signal amplitude. The amplitude J is small compared with the 
adjustable range of x. Ideally the test-signal frequency @: satisfies two 
conditions: 

1. w, is very low compared with the operating-frequency range of the 
main servo system, so that, in determining the merit m of the main servo 
system, x can be considered as a constant parameter. Consequently m is 
a function of x only. 

2. w: is high compared with the frequency range of xq, so that the test 
signal can be considered as a carrier signal in the auxiliary loop. 

Expanding m(x) about 2a into a Taylor series, 


mx) = m(xa) a m! (Xa) (x a Da) + Lom’ (aa) (a = Gale + ee (10-62) 


Substituting J sin wit for x — ta, we see that the only sinusoidal come 
ponent of frequency 1 in the first three terms is from the second term? 


m'(xa)L sin wit 


The higher-order terms can be neglected because of the smallness of |, 
The signal is applied to the correlation detector, and the output from 
latter is Kam’(xa), where m’(aa) is the derivative dm/dx evaluated 
x = a, and Ka is a known constant. 

As is usually the case, condition 1 is not strictly followed, and there in 
slight time delay in measured m from the instantaneous value of z. a 
effect is to introduce a small phase lag ¢ in Eq. (10-62). The ph 
shift network in Fig. 10-13 compensates this phase lag as well as any ol 
incidental phase shift of the test signal in the system. For instanee, 
the adjustor drive of Fig. 10-13 is a rate servo which varies z at a rate PP 
portionate to its input, the resulting integration process introduces 
phase lag of 90° in the test signal. The phase-shift network retards 
phase of the reference signal by 90° + ¢ to compensate for the comb 


The above discussions show how dm/dz can be measured by introd 
ing a sinusoidal test signal. It is a direct measurement, since the Mm 
ured quantity is independent of the rate of adjustment dx,/dt, 
signal can be utilized to adjust x, as shown in Fig, 10-13, with K\/8 a 
transfer function of the adjustor drive. We have 


dx 


he KiKam (te) = K\Kam'' (xy — a) 
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The second equality sign is obtained from E 1 : 
. (10-58). S$ 
(10-63), the result is q. ( ) olving Eq. 


Lo — La = (Go — ze = am (10-64) 


where 2; 18 the initial value of x. Equation (10-64) represents exponen- 
tial decay with a time constant 1/K,Kam”. 


Under steady-state conditions, #2 = 2,1, and the hunting loss is 


” ” 


= nm —_—_—_— 
Mon — = FG =a = “PP (10-65) 


The hunting loss is proportional to the square of the test-signal amplitude 
I. However, because of random variations in measured m, J cannot be 
made too small. The situation is essentially the same as in cee land 3 
Case 3. Optimalizing Systems of Type 4. From the previous diaoes: 
sions, we see that optimalizing systems of types 1 and 3 require dif- 
ferentiating the measured system merit m, a process which accentuates 
the random variations. The continuous-test-signal optimalizing con- 
trollers are relatively advantangeous since no differentiation is neces- 
sary. The peak-holding system is still better from this standpoint 
because it integrates the decline of m from a measured peak value 
The operation of the system is explained with the help of Fig 10-14 
At the beginning of the operation, the value of xis x; It is then Snevokeedl 
at a constant rate a. The highest measured value of m is recorded in a 
holding system. If m is going up, the new peaks are recorded. If m is 
going down, the recorded peak value does not change; however, the devia- 
tion from the peak value is integrated with respect to time io obtain a 
reverse signal. Whenever a new peak value of m is established, the 
previous reverse signal is erased to minimize the effect of spurious signal 
When the reverse signal reaches a preset limit b, the direction of adicee 
ment of xis reversed. The established peak value as well as the reverse 
signal is simultaneously erased with the reversal of the adjustor drive 
l'hus everything starts over again. 
In Fig. 10-14 the heavy curve shows the expected value of m, but 
because of random variations the thin curve is measured as the vabuk of x 
increases linearly with t. The broken curve shows the recorded peak 
value of m. The area between the broken curve and the thin curve 
represents the reversing signal. Its value as a function of ¢ is plotted in 
Vig. 10-14e. At time ¢t = t, the reversing signal has accumulated to the 
preset limit b, and thereafter x decreases with ¢ at a constant rate. The 
value of « hunts back and forth about its optimum value ao. 
' In case the random variations in measured m are negligible, Eq 
(10-52) expressing the hunting loss as one-third of the hunting sone still 
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ters at specified or extremal values by automatically adjusting the set of 
adjustable parameters. 

To be more specific, the changing situation may represent a change in 
the statistical properties of the inputs, either desired or undesired, or a 
change in the transfer function of the controlled plant. The performance 
parameters to be kept at specified values may be system bandwidth, 
damping factor, or mean-square value of control effort. The performance 
parameters to be kept at extremal values may be engine efficiency, per 
cent yield in a chemical process, the mean-square value of system error, 
etc. 

There are two types of signal flow in such a control system. The main 
servo controller receives an input signal and exercises direct control over 
the plant operation. For any particular setting of the adjustable parame- 
ters in the main servo controller, it operates in exactly the same way as an 
ordinary feedback control system. Simultaneously present is an aux- 
iliary path of signal flow solely for the purpose of making adjustments on 
the adjustable parameters. This measuring and adjusting operation is a 
control system in itself and is referred to as adaptive control loop. 

The adjustments may be based on two types of data: 


1. Measurements on a changing situation 
2. Measurements on performance parameters 


If data of type 1 are used, the adaptive control is an open-loop system. 
If data of type 2 are used, the adaptive control is a closed-loop system, as 
the adjustments change the performance parameters on which the adjust- 
ments are based. Consequently, the latter type has all the inherent 
advantages of a closed-loop system: it is self-correcting and can be 
instrumented with relatively less hardware. 


Fra. 10-14. Diagram showing signals in a peak-holding system. (a) Adjus 
parameter x versus time; (b) expected and measured m versus time; (c) rev 
signal versus time. 


holds. In terms of a and }, it is readily shown that 


m"' 4 The successful operation of a self-optimizing system depends on finding 

ed 6a (wo — 1) a quick and accurate way of detecting any change in the situation or per- 

Therefore formance parameters. In systems where the main concern is to keep the 
att % closed-loop response unchanged while the plant dynamics undergo wide 

Mopt — M = (e ot’) (I variations, a model or inner feedback loop can be used to advantage in 


conjunction with adaptive control of the system’s stability. The meas- 
urement problem is then reduced to a simpler one: that of measuring the 
closed-loop stability at high frequencies where little servo signal is present 
instead of direct measurements at servo frequencies. 

The methods of analysis and design of self-optimizing systems intro- 
duced in the present chapter are limited by one condition: that some 
means of fast and accurate measurement on the system can be found. 
When this condition cannot be safely assumed, the design of the adaptive 
control loop is no longer a simple problem but will be the subject matter of 
the next chapter. 


As before, the selection of b is determined by considerations of hun 
loss and false reversing signal; the selection of a is determined by thet 
delay in measuring m. 

10-8. Summary. The preceding sections give a review of the pring 
and types of self-optimizing systems in existence or having been prop 
Although it is not all-inclusive, the main species are represented, 
way to characterize these systems is as follows: 

1. The system operates under a changing situation. 

2. The system has a set of adjustable parameters. 

3. The ultimate purpose is to keep a set of system-performance 
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CHAPTER 11 


STOCHASTIC PROCESSES AND OPTIMUM DESIGN 
OF ADAPTIVE CONTROL 


11-1. Introduction. The preceding chapter on self-optimizing sys- 
tems is concerned with applications where accurate and fast enough 
measurements of the situation or performance parameters can be made, 
The lack of this condition does not necessarily mean that self-optimiza- 
tion will not work but rather that a more careful process of analysis 
and design is necessary in order to gain most from the self-optimizing 
operation. 

Some of the general problems are as follows: 

1. Measurement. As a stochastic signal of one form or another if 
present, the probable error in measurement is large if a very short interval 
isused. On the other hand, if a long interval is used, the measured result, 
which is a product of a past situation, would not adequately reflect the 
present situation. A compromise in the promptness and accuracy has 
be made. What represents the best compromise? 

2. Adjustment. The adjustment is of future operation while it is ba 
on present and past data only. Furthermore, the data are inaccuri 
because of the compromise mentioned in problem 1. Considering 
these factors, the current data are not necessarily the sole criteria f 
adjustment in an optimum system. What weights should be assigned 
previous data in computing the required adjustments? 

3. Evaluation. Given a statistical description of the changing sil 
tion and the probable error in measurement, how close can one hold 
performance parameters to specified values or to their respective extre 
values? 

Preliminary to a solution of these problems is a criterion for the 
adaptive control system. For extremal secking systems, a natit 
criterion is the difference between the extremal value and the ave 
value of the performance parameter. For systems that regulate 
formance parameter to a specified value, the mean-square value of 
deviation is used. 

In this chapter, the random errors in measuring signal properties 
transfer functions are discussed. While these constitute only 
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limited domains in the entire realm of measurements, they are neverthe- 
less quite representative of the stochastic processes encountered. 

A solution of problems 2 and 3 is given for systems which base their 
adjustment operations on a directly measured performance parameter. 
The situation parameters are not known or measured. The approach 
is as follows: Given the values of the interval and probable error of meas- 
urement and some statistical description of the changing situation, the 
optimum weighting factors on present and past data are determined. 
The deviation from the extremal value or the mean-square deviation 
from the given value is then calculated. It is the lowest obtainable for a 
given method of measurement. To solve problem 1, that of the best 
method of measurement and the optimum T' to use, one has to know the 
dependence of the probable measuring error on 7’, which is different from 
one problem to another. However, when this dependence and the solution 
of 3 are known, the solution of problem 1 becomes a matter of calculus. 

11-2. Assumption of Approximate Stationarity in Identification Prob- 
lems. One common problem in adaptive control systems is to determine 
quickly and accurately the statistical properties of the signals and trans- 
fer functions of the systems in the presence of various disturbances. 
Strictly speaking, transfer functions and spectral densities are not defined 
for systems that are not stationary. However, if the time interval in 
which appreciable change takes place is much larger than the correlation 
times of the signals or response times of the systems, we may define 
spectral densities and plant transfer functions in an approximate sense 
to reflect the immediate situation. Mathematically it is assumed that, 
in the interval t — A < t’ < t+ A, the correlation function ¢10(¢, ¢ + 7) 
satisfies 


gilt, t +7) = on(t’, ¢ +7) (11-1) 


where A is a time interval many times larger than Tm, the largest 7 for 
which $12 is substantially different from zero. Consequently, we may 
use time average instead of ensemble average to determine ¢12(r), and 
the approximate spectral-density function ®1.(s) is then the Laplace 
transform of ¢12(r). Of course ¢12(7) and ®12(s) depend on ¢ in the long 
run. A similar situation holds for time-varying systems if its impulse- 
response function satisfies h(t’, t’ — r) & A(t, t — 7). 

In the following analysis we assume that the averaging interval is short 
enough so that the measured quantity has not changed appreciably 
during the time of measurement, and the error is due only to the finiteness 
of the measuring interval. 

Sometimes a certain amount of prior knowledge about the measured 
quantity is assumed. ‘This knowledge can always be used to advantage 
in reducing either the required time of measurement or the probable error. 
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11-3. Determination of Power Spectrum of Signal i(t). We shall 
analyze two situations: 

1. ®,(jw) is completely unknown. In such a case, the rms value of 
the per-unit error is given by Eq. (7-15): 

1 

SPA 
where 7 is the measuring interval and Af is the frequency resolution. 

2. %;;(jw) is known except to a proportionality or magnitude factor. 


In other words, ®i(jw) = KF(jw), where F(jw) is a known function, 
Since 


€ 


(7-15) 


$i(0) = [ "Tejas he = & [ ”-F(je) de (11-2) 


once ¢i:(0) is known, K is determined. If we assume T' > 7m, the error in 


measuring ¢i:(0) in the finite interval 7 is easily calculated from Eq, 
(C-15): 


(6u(O)noae — (0)}) = Hf Couto) a (11-8) 
2 — (it(O) meas — $:i(0)]?) 
Then 2 = [u(0) 2 
2 is ®;; jw) 2 dw 
m * fo! (jw)] eh a (11-4) 
T| f° eu(jw) de | x 


where BW is the bandwidth of i(t) in cycles per second. 

It is interesting to compare Eq. (7-15) with Eq. (11-4). In order 
obtain an adequate amount of detail in the measured spectrum, Af 
usually less than one-tenth of the signal bandwidth. The ratio BW/ 
represents the advantage of having prior knowledge about the frequen 
dependence of the power spectrum. 

11-4. Determination of the Impulse-response or Transfer Functi 
of a Linear Control Element. Figure 11-1 illustrates a fairly gene 
situation in the experimental de 
mination of the impulse-response fu 
tion (or transfer function) of a lity 
control element or system. The | 
put z(t) is either a test signal or 
signal that can be measured, 
measurable output v(t) is the respo 
to i(t) plus some unknown dist 
ance n(t), where n(t) represents the equivalent output variation due 
disturbances existing in the measured element and random error 
measuring i(t) and v(t). The signals ¢(¢) and n(t) are uncorrelated, 


Fie. 11-1. Block diagram for deter- 
mination of h(r) or H(jw). 
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obvious that there is no way of separating n(é) from v(é) in a direct deter- 
mination of h(r) with the impulse-response method except by repeating 
and averaging, which is not an easy thing to do as far as instrumentation 
is concerned. The other two methods will now be studied: 


1. Correlation measurement with white noise as input, ¢(r) = (7). 
If the measuring interval is unlimited, 


dio(t) =f,” dar — tA) dt! = he) (11-5) 


With the measuring interval limited to T, T > tm, Eq. (C-15) can be used 
to evaluate the mean-square measuring error: 


([iv(T) meas — Pir(7)]?) = 7 | =O Ae ne dio(t + 71) b(t — 71) an| 


Owing to Eq. (11-5) and noting that h(r) = 0 forr < 0, the above equa- 
tion can be rewritten as 


hence = HEP) = SH oe L° AUnGe dT + Mee + rr — 1} dr 


(11-6) 

The interesting aspects of Eq. (11-6) are the following: 

a. There is a mean-square error of the order tm/T' even if ¢nn(0) = 0. 
This is due to the randomness of ¢,:(r). 

b. The mean-square error is proportional to 1/7. 

2. Frequency-response method with a sinusoidal test signal. Let 
I(jw) and V(jw) represent the Fourier transforms of 7(¢) and v(¢) over a 
finite interval 7. The measured H(jw) is obtained as 


H (je) men = TES) = H (Ga) + U2) (11-7) 


The mean-square error is 


(\HG)neor ~ H(Ga) |?) = SRP G ED? (11-8) 


Let P represent the mean-square amplitude of the input signal. Then 
21(—jw)IT(j 
P= i) (jo) 
: 2 ; . 
PinGo) = 7 (N(—jo)N (Jo)) 
Equation (11-8) gives 
(\H Ge) meas — H(ju)|*) = Pee) sche 


The measured /7 (jw) is the vector sum of (jw) and a random vector of 
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mean-square amplitude ®,,(jw)/TP and random phase. Physically, 
Eq. (11-7) can be carried out in two different ways: 

a. A current proportional to v(é) can be applied to an antiresonant cir- 
cuit of very high Q at frequency w. If we assume that the circuit is 
totally discharged at the beginning of 7, the voltage across the anti- 
resonant circuit at the end of T has a vector amplitude V(jw). Its phase 
and amplitude are then compared with the known vector I(jw). 

b. Equation (11-7) can be written as 


In Eq. (11-10), 7(—jw)I(jw) is known. The real and imaginary com- 
ponents of I[(—jw)V(jw) can be obtained by integrating the products 
z(t)v(t) and 7(t)v(£) for an interval 7, where 7(t) is the same as 7(¢) with a 
phase shift of 90° ahead. Since 7(t) is a sinusoidal signal, this is not 
difficult to do. 

3. Measurement of ®;;(jw) and ®;,(jw) simultaneously for an interval 
T. The method of measuring ®;;(jw) and ®;,(jw) is the same as that dis- 
cussed in Sec. 7-3. Let us assume that the variation of H(jw) in passband 
w + 14 Aw is negligible. Then for all w’ inside the passband 


V (je) = 1(ju!)H (ja) + N Gal) (11-11) 
Bij) meer = Tay », 1( =!) Ge") (11-12) 
Bis jt) meae = Prag » I(= je!) V(je") 

“Ty [a v3 I(— je!) I(ja!) + 1(—is!)N Ge) 


(11-18 


There are 7’ Af Fourier components in the frequency range Af or Aw/2w 
and the summation signs in Eqs. (11-12) and (11-13) are meant to su 
over these components. From these equations we obtain 


Pj, (jw) meas 


#17) say ®;;(Jw) meas 


I(—jo"\NGo’) 
oo Ga es eas 
reat ¥ (ie!) 


w” 


The second term represents an error with random phase, ‘To cale 
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its mean-square value we write 
» I(—jo")N (jo’) 
NI y I(—jo"”) IG") [ . 


I(jw 
+) <gttttey a p> dN Gsd9 (11-15) 


By definition of the spectral density, 


(|H (je) meas — H(jw)|*) = 


(LN (joo’) |?) oe 5 Pan( i) = 5 Bax (j) (11-16) 


Substituting Eq. (11-16) in (11-15) gives 


(\H (Je) meas — H(ju)|2) = 5 Pan je) Cig, (11-17) 


There are T Af terms in the summation over w’’. Let Pa,n = 1,2,... . 
T Af, represent 2|J(jw’’)|? at each w’. Each term has a density function 


as given in Kq. (7-13). The assumption 
Pii(ja’’) = By(ju) 


for all w’’ inside the interval w + Aw/2 is equivalent to Z, = E = ®;;(jw) 7. 
Then ; 


EFS A jr eke 
¥ [TGo") |? a ; oa ira: Rid 


(11-18) 


where m is the number of Fourier components in Af. Since Pighs,e Aes 
P, are dummy variables, the integral on the right-hand side of Eq. 
(11-18) is a function of F oily Let it be denoted as J(#). Then 


J(0) =0 

dJ(E) 2 ~ Dare 

“a = at ite te Py Psy gg Pe 
<, Em = 2hm-4 


“i (11-19) 
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Integrating Eq. (11-19) with boundary condition J(0) = 0 gives 
2 


m—l1 


2E-} 


Er 
28;;(jw) 


J(£) = 


d = = 11-20 
Therefore SG Tinie = ee | Tr Af aH 1) ( ) 
Substituting Eq. (11-20) in (11-17) gives 
, ; Prn(jw) 2 
(|H (jo) meas aed H (jw)|*) = (F Af — NPaCjo) (il 21) 


In Eq. (11-21), Af is the frequency resolution in determining H(jw), 
T is the time of measurement, ®;(jw) is the power spectrum of the meas- 
urable input signal, and ®,,(jw) is the power spectrum of the equivalent 
disturbance at the output end due to unknown inputs (such as load dis- 
turbance) and random errors in the measurements. 

The signal i(t) may be a test signal already in existence. In the latter 
case, one must ascertain that i(t) is constantly present and has a fairly 
consistent power spectrum in the frequency range of measurement. ' 

11-5. The Use of Prior Knowledge in the Determination of H (jw). 
The preceding section gives the basic material in the determination of 
transfer functions. To see its application to some specific problems, let 
us consider the following situations: 

1. H(jw) is completely unknown. 

2. H(jw) is known except for two parameters. aA 
The problem is to see how much improvement there is in case 2 over 
case 1 if a test signal is to be used in each case. 

For simplicity, let us assume that H(jw) is negligible for w greater that 
2nf, and that it is required to determine H (jw) with frequency resolution 
Af and a per-unit mean-square error no more than e? within a frequen 
range 0 to fi. ; 

For case 1, let 7: represent the time interval of measurement. qu 
tion (11-21) gives 

wie Pan(Jo) 
We) ™ (Ts Of — YelHGo)P 


The total disturbance caused by the test signal is 


fi 1 fi i 
a ai ep) |? = ooo Pan(jo) af 
dh i $i (jo)|H Ge)? Uf = CAF The I 


0 


“(haf - De 


For case 2, it is only necessary to measure H(jw) at SOME ws, Then 
unknown parameters and [/(jw) at other w are determined to app 
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mately the same degree of accuracy. Equation (11-9) gives 


P Pan (jw) 


2= Tel Gon)? (11-23) 
Since dz? = P2|H(jws)|?, we have from Eqs. (11-22) and (11-23) 
dT; Panfi 
= : 11-24 
d?T., Pnn( Jw) Af ( ) 
The quantity d°7' can be called a figure of demerit. For case 2 the figure 
of demerit is reduced by a factor of Af/f; times ®an(jw2)/Pnn. The latter 


ratio represents the advantage of being able to select a frequency at which 
the noise power is low. 

11-6. Stochastic-adjustment Problems. “Stochastic adjustment” 
means making an adjustment based on data with random error. If 
some factors about the system are unknown but do not vary with time, 
the correct settings of the adjustable parameters do not vary with time. 
Our problem then is to devise a process of adjustment that eventually 
but unerringly leads to the correct settings. Methods of doing this are 
called stochastic-approximation methods. 

Of greater interest to engineers is the stochastic tracking problem, in 
which the system and consequently the correct settings of the adjustable 
parameters vary with time. Because of the random error in measure- 
ment, we can never be completely sure whether an indicated change 
reflects an actual change of the system or is merely an error in measure- 
ment. There is no way of determining the correct settings all the time. 
Our problem then is to find the best process of adjustment. One way to 
define “best”’ is based on the criterion of Sec. 11-1, i.e., least expected 
difference between the extremal value and actually attained value of the 
performance parameter for extremal seeking systems and least-mean- 
square error for systems designed to regulate a performance parameter to 
a specified value. 

In the following analysis the process of measuring and adjusting is 
assumed discrete. Whenever a performance parameter cannot be com- 
puted by continuous analog method, the process has to be discrete. When- 
ever continuous measurement and adjustment are possible, we may simply 
carry the analysis over by making the sampling period approach zero. 

11-7. Stochastic-approximation Methods. The idea of stochastic 
approximation was originated in a paper by Robbins and Munro and 
later generalized to- peak-seeking systems by Keifer and Wolfowitz. 
Before going into the general case, we shall illustrate the underlying idea 
by a simple example; A certain variable x is to be determined by measure- 
ment, and there is a random error 6 in the measured value of x (denoted 
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as y): 
Y= 2 + 6; t= uP 2, ni eaks (11-25) 
The errors 6; are assumed to be independent for different measurements 


i, and 6; has zero mean. 
After n measurements, the best estimate of x is 


i= 


ale ace Oe 11-26 
m=) wertiys ( ) 
t=1 


i=1 


As n approaches infinity, 26; approaches +/n 6 and consequently , — # 


approaches zero. ; a. 
Another way of looking at Eq. (11-26) is to view it as a successive 


adjustment process: 


t=n 


=!) u 
ogee Yi 


i=1 


Ym = tea + gol 


agrees ie i feresdtant (11-27) 


After n measurements, our best estimate of x is 2p. However, at the 
(n + 1)st measurement, yn41 is obtained, and some correction has to be 
made toward yny1. As the value z, has the weight of n measurement# 
behind it, we make the coefficient of adjustment smaller as n becomes 
larger. 

The idea can be carried over to the general case. Instead of Iq. 


(11-25), let. m’ be the measured value of the performance paramete 
m(x): 
A 


m, = m(ai) + 4: ae lee? Mi ae (11-28 


The variable « is to be adjusted toward a certain unknown setting # 


which can be defined in two ways: 
1. Adjusting for a certain specified m,: 


m(ao) = Ms 
m(c) >m, forx > x (11-20) 
m(x) < m, for x < x 


2. Adjusting for some unknown peak value of m: 
m(ao) > m(x) for all « # xo (i 


For case 1 we can make successive adjustments on « according 
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generalized form of Eq. (11-27): 


Lnt1 = Ln + an(m, — m) Mca Ne Deo 5 a oe (11-31) 


Let us examine the conditions on the sequence of numbers a, so that, as 
n— ©, & converges with probability 1 on zo. 

a. a, > 0. This is to assure that the corrections are to be made in 
the right direction, on an average. 

b. dn > 0 as n— infinity. Because of the random variations ‘6; in 
Kq. (11-28), m, — mj, is not zero even if x, = x. For a, to converge on 
any value at all, condition b must be satisfied. 

n=0 n=0 
o ») a,*— const, or, put in a different way, = a,?— 0 as N 


n=1 n= 
approaches infinity. Condition c is to account for the cumulative effect 
of 6;. Equations (11-28) and (11-31) can be combined as 


Lnt1 — Ln = —G,[M(tn) — m,] — And, 


Summing the above equation from n = N upward gives 


te — tv = YS anim, — m(t)] — Yo ands (11-32) 
n=N n=N 


Equation (11-32) expresses the total variation in x from the Nth step 
onward. Since 


(CEawy>- her aus 


ate 

condition c assures that the total random variation > Gn5, approaches 
n=N 

zero as N becomes very large. 


n=0 
d. y, a,— ©. Conditions a, b, and ¢ assure that 2, converges on 
n=1 


some value 2. Condition d assures x, = 2». Since condition d also 


n=0 


implies bY a, — ©, if x, approaches any value other than 2, the total 
n=N 


n= 
corrective effort > a,l[m, — m(an)] in Eq. (11-32) is infinite. On the 
n=N 

other hand, we have no fear of overshoot since each corrective step is 
very small, because of condition 6. 

An example of a sequence satisfying conditions a, b,c,anddisa, = 1/n. 
We note that this is the sequence used in Hq. (11-27). 

The above is intended to explain the significances of the conditions 


————— NT 
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a, b,c, andd ona, Fora rigorous proof of stochastic approximation, 
‘se reader is referred to the paper by Robbins and Munro. — 
Bertram applied the stochastic-approximation idea to con 2) a , 
If the signal part of the input does not change with time, Bae! sage a 
can be achieved eventually by a ae eeaguagtions ie aga all 
ward gain of the control system. ie! ik Lagat 
tem and input-noise statistics. 
For the peak-seeking case, a satis= 
factory procedure has been worked out 
by Keifer and Wolfowitz. The prob- 
lem is illustrated in Fig. 11-2. The 
system merit m is a function of a 
parameter z as given by the solid 
curve. However, for each measure= 
ment, there is a random error 6, and 
z the measured value of system merit, 
=m’, does not equal m. Let our 
Fria. 11-2. Expected and measured present estimate of the optimum sete 


erit (m and y, respectively) , > impea 
Pe iatetle parameter 2. ting wo be denoted by x1. T p 


wee enamel =O St. 
e 


_ ee ee ee 


R 
e 

bal 
c—) 


xy ey x +c, 


our estimate, we set 71 at two different 


values 21 + c; and 21 — ¢: and obtain two readings yi and yj. Our esti 
mate is shifted in the direction of larger y: 


m= +o (yn — HH) (1 

Jor 

The above processes are repeated with different values of a’s and c's: 
tne = to + (Yn — Ya) 7 Wide geen (11 


The problem is: Are there two sequences of constants, On = 
such that in the limit of large n, tn converges to xo? The 7a 2 
been answered in the affirmative under the following rather ge 


assumptions: 
1 The function m(a) hasno more thana finite number of discontinuil 


2. There is only a single peak of m(a). ' ; 
The sequences of constants @n and c, which give this result are specifi 


by the following: 


a, 2 9o tg > 0 oe 
an — 0 Cn 2 0 asn— © ( 
ye <0 a 
Cn 
Ya, = (i 


Lance, < 
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An example of two such sequences is 


1 
a, = - = ar 

An explanation of the above conditions is as follows: The peak seek- 

ing is accomplished by finding the point for which dm/dx = 0. As 
(yn — Y},)/€n is approximately proportional to dm/dz, it is used in Eq. 
(11-34) instead of the measured error, m, — m(a,). To obtain zero error 
as n— 0 the excursions c, for determining dm/dz must approach zero. 
Conditions (11-36a) to (11-36d) are equivalent to the same conditions in 
case | and are for precisely the same reasons. Condition (11-36e) is to 
avoid biasing the measurements appreciably when d?m/dz? are not the 
same as the two sides of the peak. 

We note that in the learning situation the method is satisfactory: 
Not only does x, approach 2, but since c, approaches zero, tn + Cn also 
approach x. After some initial adjustment period, the system operates 
at the optimum point with vanishing hunting. There is still the problem 
of the optimum choice of the sequences a, and ¢, such that the speed of 
convergence to 2p is fastest. Since there is more than one way of satis- 
fying Eqs. (11-36), which is best? 

In the tracking situation, xo is a function of time. On the other hand, 
the sequences specified by Eqs. (11-36) do not allow z, to change with 
any finite rate at all, since a,/c, approaches zero. 

11-8. Basic Assumptions in a Solution of the Stochastic Tracking Prob- 
lem with Specified m.t In the remainder of the present chapter, we shall 
study the problem of stochastic adjustment with changing situation 
parameters. Generally speaking, there are a number of adjustable 
parameters among which a lesser number are independent. To bring 
forth the underlying concepts with least mathematics, only one independ- 
ent adjustable parameter is assumed. 

Let a;, mi, and x; denote the situation parameter, performance parame- 
ler, and the adjustable parameter, respectively, at the 7th measuring 
interval. The measured performance parameter m’, can be written as 


faa . 
mm, = Mi(ar,a2, ano! ip Oea, Ost Ms, 6. 


4 50421522) =f 6; (11-37) 
‘The value of m; depends on the present as well as previous values of a and 
v, and 6; is a random variable representing error in measurement. Let 
m, denote the specified value of m;. A condition representing adequate 
voverage by «x of all possible situations a can be stated as follows: For 
every sequence aj, a, ... , a with nonzero probability, there exists a 


| Sections 11-8 to 11-14, inclusive, are reprinted from a paper by the author, 


Optimization of the Adaptive Function by Z-transform Method, 7'rans, AI EE, pt. U, 
July, LOBO, 


el 
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, Lai Such that 


SEQUENCE Lai, Var, + + - 
Latj-0,Laj) = Ms, (11-38) 


M;(o1,02, ee oF Oj—-1,0j ;Ta1,Vars 2a 


for allj = 1,2,...,7. This condition is necessary if it is at all pos- 
sible to hold m; to the specified value ms and will be assumed valid. 

It will be assumed that the tracking process is fairly good so that 
La, — vi is small enough for the following approximation to be good: 


Xi) ; 
Lai) + > cal (a; — 2aj) (11-39) 


where the partial derivatives am;,/dx; are evaluated at 2 = Lai, j = i 

, i. These derivatives depend on the a’s only. Usually the time 
interval of measurement is larger than the response time of the main 
servo system, and m; depends mostly on 2, slightly on 2-1, and not at all 


on the previous 2’s. 
As the situation parameter a is unknown, the exact values of the partial 


derivatives dm,/dz; are not known. The proposed procedure is based on 
a set of nominal values b, defined by 


om; 
(s i hes (1 


The adjustment procedure is represented by the following equation | 


M:(a1,02, soe (Qi 1M Lia, - + + 9 


= Mi(a1,02, see Oi satsVar, + + + 


_ 


to = & + ¥ wi_;(ms — mM) (11-41 
j=1 
where wn, n= 0, 1, 2,..., are the weighting factors attached 
present and past measurements. The optimum values of wn are 10 
determined. 
The above assumptions can be summarized as: 
1. Condition of adequate coverage: Eq. (11-38). 
2. Linearization: Eqs. (11-39) and (11-40) give 
m', = Ms + bY bi (a; = Laj) + 6j en 
j=1 
3. Adjustment procedure: Eq. (11-41). 
11-9. Z-transform Representation and Optimization. Let the 


ured difference u; be defined as 


u=m, — mi, (I 
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Multiplying Eq. (11-42) by z~‘ and summing over 7, it becomes 


2 bs-j(Laj = xj)z- 


0 7=1 
re) 


U(z) + A(z) = 


v 


Ms M8 


aay -| »> B29 (Laz — 2)279 
j=0 i—j=0 


= B)[Xa(z) — X()] (11-44) 


me U(e), A(z), B(z), Xa(z), X(z) are the corresponding z transforms. 
ultiplying Eq. (11-41) by z-*1 and summing over 2, it becomes 


X(z) = 2-1 X(z) + 2 W(z)U(z) (11-45) 


where W(z) is the z transform of the weighting f i 
Wag aeee ghting factors. Equation (11-45) 


X(2) = ~~ WUE) (11-46) 


gol 


Equations (11-44 - i 
= ene ( ) and (11-45) are represented by the block diagram of 


X,(z) 


Nia, 11-38. Block diagram of the adaptive control loop. (From 8. S. L. Chang, Opti- 


mization of t | ] 
try of the Adaptive Function by Z-transform Method, Trans. AIEE, pt. Il, July, 


In Fig. 11-3, the only unknown function i i 
4 Ae s 
ee ee nis W(z). This function is to 


D = (m, — m)? = min (11-47) 
Let a; represent m, — m;. Equation (11-44) gives 
U(z) = B(z)[X.(z) — X(@)] (11-48) 


thet (11-47) and (11-48) and Fig. 11-3 specify the optimization 
problems completely. Using the method of Sec. 6-8, we obtain 


__@- DK) 
W® = Rant Kel ae 


where K(z) is the closed-loop s i i 
se system funct f 
‘en teins ction of the adaptive control loop. 
—1 
K@ = 1 B(z) Bz et | 


Y@ Vr (11-50) 
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In Eq. (11-50), Y(z) is defined by the following two conditions: 


t, 
Y(2)Y¥(e-}) = B)B(e)®eqza(2) + Pu(2) (11-51) 


2. All the poles and zeros of Y(z) are inside the unit circle. 
The symbol [ |; is defined as follows: Any rational function Q(z) can_ 
be expressed as 
= A; B; Ae 
Q(z) = bY oe + », An, + polynomial in z 
i 


j 


where |a;| < 1 and |b) > 1. Then 


(Q@k = es ary 


a 


From K(z), D can be calculated as 


(11-52) 


Dee) = guy $ BOB] .ee)ll ~ K@ML — KEM 
+ K@OKE eae) 2 C15 


Example 11-1. Sometimes the operating situation changes in sudden steps. 
respondingly, ta is also steplike, and its power spectrum can be approximated as 


200! 
Peesalt) “a 
C(l — e727) (i 


Pz gzq (2) gs (a i erTz)(1 c= e~’Tz) 


The random errors in successive measurements are assumed to be independent, 


53(z) = A (iu 


where A is the mean-square value of 5;. In case the response time of the main 
system is small compared with 7’, the constants b, are approximately 


bo = 6b b; =0 for alli #0 (uh 


The optimum weighting factors wi and minimum value of D are to be deter’ 
Solution. By definition, 
b?Ci(1 — e7?’7) 
= Ss OO om 
Y@Ye") = @ — e'F21)(1 — e-"72) +4 
As M21 — az—)(1 — az), 
@ — e’Fe-!)(1 — e7"72) 


where the constant a is given by 


2 
a+ 1 = 2 cosh v7’ + wre sinh v7 


a<l 
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and the constant M is given as 


M2 = b?Ci(1 — e-2”T) 


(1 — ae~*T)(1 — ae’) (11-59) 
It follows from Eq. (11-57) and condition (2) that 
Y() = M(1 — az) 
A rs a (11-60) 
From Kgs. (11-52), (11-54), (11-59), and (11-60) we obtain 
ae | M(e"T — a) 
Y (2) an or ay = (11-61) 
Equations (11-50), (11-60), and (11-61) give 
K e (e-”? — a)z7 
(6 Sear od (11-62) 


Knowing that K(z), W(z dD i 
Fre ae (2), an are calculated according to Eqs. (11-49) and (11-53), 


— ('? —a)(l - 2 
We) es at ) 


1 
D -(Ger-1)a 


In case of perfect measurement, A = 0. However, as 1/a becomes infinity, the value 
, 


D Ss y 
1 indeter minate. To ev aluate this limit let A be an arbitr ar il small value €, and 
Kq. (11 58) gives 


(11-63) 


(11-64) 


20°Cns h 
- Se vT (11-65) 


aces 
.= 
Substituting Eq. (11-65) into (11-64), we obtain 


Do = lim D = b?C\(1 wb e72T) 


4-0 (11-66) 


Do represents the probable error i i 
: . wer : i 
ntsc or involved in predicting the operating situation of the 


Next the weighting factors w; are calculated. Since 


Wz) = 2 went 
. i=0 
Inq. (11-63) gives 


IE nice 
Mining 2 
Wn = —wole’™ — 1)e~mT On i Ep? 46, ea (11-67) 
TY ‘ili ' ‘ ‘ 
© facilitate further discussion, two numerical examples are calculated. The given 


values and calculated results are listed in Table 11-1, 


———EEOOO 
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Taste 11-1. Two NuMericaL Cases 


SN eS 


Case 1 Case 2 

Given values: 

PP ow Nec see 0.1 0.01 

RES et ERR Lich 0.2b°C, 0.2b°C, 
Calculated values: 

Cicer ase hae 0.380 0.729 

1 Bes ecco ee tee a AE 0.276b2C; 0.071462C, 

Y DT WR ERRNO ton oT 0.182b2C; 0.0198b2C1 


The constant b2C; is the mean-square variation in m without adaptive control. It 
is used here to gauge the mean-square error in measurement as well as the effectiveness 
of the adaptive control loop. The deviation function Do for the ideal situation of 
perfect measurement is also listed for comparison. In case 1, the variations in oper= 
ating situation are relatively fast, and one cannot effectively reduce the measuring” 
error by averaging over a number of past samples and still keep up with the changing 
situation. In case 2, A is ten times larger than Do, and some averaging can be done 
to reduce the random measuring error. 


A surprising fact about Eq. (11-67) is that the weighting factors 0 
past data are negative. However, in close examination of Fig. 11-3 0 
notices that these weighting factors represent only the immediate ¢o 
tribution. The total contribution of U(z) to the value of X(z) is given r 


= a a _o*-¢_ 2a 
V(@) og » sad | Ee closed loop ¥ b Ls az~" 
Therefore, v0 = 0 
eT —a 8 
Vm, = —— _ a} = 2 Oe eee (il 


b 


The factor V, gives the total effect of a measured change of one unit 
the ith interval on the value of x at the (i + n)th interval and will 
called the effective weighting factor. In Fig. 11-4, the immediate (#0 
line) and effective (broken line) weighting factors are plotted for case 

Another interesting feature of the system can be illustrated by 
transient response to a step input in 2a, if it is assumed that A(z) = 0 

i 

Xa(z) = Too 

(eT — a)z“} 


X(z) = K(z2)X.(z) = (1 — az)(1 — 2") 


to = 0 


a 
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The unit step response is plotted in Fig. 11-5. It gradually approaches 
a final value of 0.847 instead of unity. The reason is that, since the next 
value of Ze is uncorrelated to its present value, the sstondtaa change 
in %q is more likely to be in the opposite direction. The adaptive con- 
troller does not go all the way in anticipation of this change. 


Mia. 11-4. The weighting factors w; (solid line) and effective weighting factors 0; (broken 


line) in units of 1/b. (From S. 8S. L. Chang, Optimizati 
Z-transform Method, Trans. AIEE, pt. Il, 7 ‘uly, 1960.) ila i et 


11-10. Reduction to Continuous Case. Reduction to a continuous 
case is possible, whenever the variations in situation parameter a are slow 
enough for the following two effects to be negligible: 


1. The adjustment delay of one measuring interval 


2. The graininess of a finite number of measured values being weighted 
or filtered in arriving at an incremental value of x 


lor instance, case 1 is definitely discrete. A continuous approximation 
would be too erroneous to be useful. Case 2 is at the borderline region. 
An indication is that its first four or five effective weighting factors u; have 
substantial values. , 


If we assume the effects 1 and 2 to be small, Eqs. (11-41) and (11-42) 
become - 


m'(t) =m, + i, b(t — r)[x(r) — aa(r)] dr + 5(2) (11-70) 


dix(t) inn = a 


t 
dl = if w(t = r)[m, — m'(r)| dr (11-71) 
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where in the limit of T> 0,n— ©, andnT’ =r 
bn 
b(r) = b(nT) = (11-72) 


w(e) = a(n) = (11-78) 


Let u(r) = m, — m’'(r). Equations (11-70) and (11-71) can be written 
as 

U(s) = B(s)[Xa(s) — X(s)] — A(s) (11-74) 

“s sX(s) = W(s)U(s) (11-75) 


Equations (11-74) and (11-75) are represented by the block diagram of 
Fig. 11-6. 

The special density ®53(s) can be derived as follows: If each measure- 
ment is assumed to be independent, the correlation function $55(7) is a tri- 


Final value of x(t) 


0 sk a8 37 4T 5T 6T IT. t 


Fic. 11-5. Transient response of the adaptive control loop to unit step at ¢ = 0, 
(From S. 8. L. Chang, Optimization of the Adaptive Function by Z-transform M. 
Trans. AIEE, pt. Il, July, 1960.) 


-A(s) 


i imati i ing situation, 
Fia. 11-6. Continuous approximation of a system with slowly varying si 
S. 8. L. Chang, Optimization of the Adaptive Function by Z-transform Method, 
AIEE, pl. 11, July, 1960.) 
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angular pulse with height A and base 27. Therefore 
$us(8) = [7 du(r)e dr = TA (11-76) 


The method of Chap. 4 can be used for the subsequent optimization 
calculations. 

11-11. Basic Assumptions of Extremal Seeking Systems. The 
parameters a;, mi, x;, m;, 6; are defined as before. The performance 
parameter m; is assumed to depend on a; and 2; only, and 


mi, = mi(ai,xi) + 4; (11-77) 


In contrast to the previous case, m; is to be held at either an unknown 
maximum or minimum, depending on the problem. With no loss of 
generality, it is assumed here that m; is held at a maximum. 

The condition representing adequate coverage is stated as follows: 
For every ai, there exists an Xa; such that 


Mi( ci; 2e;) = Max = Mopt (11-78) 


The optimum value m.» is not known. Equation (11-78) defines 2a,;. 
It is further assumed that the tracking process is fairly good so that 
Ta; — 2 is small enough for the following approximation to be good: 


2m. 
mia) = miata) + 5( GM) (tas 2) (11-79) 


A nominal value —b’ is selected for the second derivative (0°m:/dx;?) a, 
and Kq. (11-79) becomes 


y 
Mi(ai,Li) = Mopt — 4 eae oe) (11-80) 


The desired result is a procedure for adjusting 2; so that 


D = my — Mm = (a; — 2%)? = min (11-81) 


One characteristic aspect of extremal sensing systems is the hunting 
procedure. Even with negligible measuring error 6, one has no way of 
telling whether m; is at its peak without introducing some intentional 
variation in x. The manner in which this variation is made is called the 
hunting procedure. 

With reference to Sec. 10-7, Draper and Li have suggested a number of 
hunting procedures in their work. However, one of their basic assump- 
tions appears to be that the performance parameter can be measured 
instantaneously and that this measured value depends only on the pres- 
ent value of x, In a relatively fast-varying system, if one makes the 
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measuring interval long enough so that the measured m is sufficiently 
independent of a and x outside this interval, the measurements can no 
longer be considered continuous since a is likely to change substantially 
within a few such intervals. 

Two hunting procedures which recognize the discrete nature of the 
problem will be analyzed: 

1. Derivative Sensing. The adjustable parameter x of a typical 
derivative-sensing system is illustrated in Fig. 11-7a. At the end of the 


X, X’ 


Ose, (i-1)T 


(b) 
Le 


(i-1)T | (i+1) 
i (i+2)T 


Fic. 11-7. Best estimate of adjustable parameter setting x; (broken line) and act 
adjustable parameter setting zi (solid line) of the two types of extremal seck 
systems. (a) Derivative sensing; (b) alternative biasing. (From S. S. L. © 
Optimization of the Adaptive Function by Z-transform Method, Trans. Al EE, pt. 
July, 1960.) 


ith interval, x is adjusted to 2; + q and 2 — q in succession for smal 
intervals Tj, and the subsequent 2:41 is determined by m/,; — Mi, ™ 
ured at the two intervals 7'1, as well as all such previous measuremen 
This method is currently in use in the chemical industry to optimt 
yield with satisfactory results. However, for systems with relativ 
fast changing situations, the following proposed method would 
preferable. 

2. Alternative Biasing. ‘The best estimate of the adjustable para 
Za is illustrated by the broken lines in Fig. 11-7). However, inst 
using this estimated value a, the actual x is adjusted to x + q, 
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is illustrated by the solid lines. One obvious advantage of this method is 

that measured data at every interval are utilized on an equal footing. 
eae sesasdant of Derivative Sensing Systems. The changes in x 

and mo»: between the two measuring intervals 7’; are ass ‘gil. 

umed 
From Eqs. (11-77) and (11-80), ahr 
P bY 
My = Mopt — 9 (La, eee, See q? + 6; 


b’ 
Mig = Mopt — 5 (Ha; — Li + Gg)? + by 
The difference of the two equations is 
Mi — Mig = 2b'G(La; — Li) + bi, — 4i, (11-82) 
The correction in 2; is given as the weight i 
i ed sum of 
eae g of all previously meas 


jai 
tin. = a+ 2, wi_;(m!, — m’,) (11-83) 
a 


= (11-82) and (11-83) are identical with Eqs. (11-41) and (11-42) 
Ui mi, io mM, 
5 = by — by 
bo = 2b’¢ b =0 for all i # 0 


Consequently, all the analyti i i 
ytical results of the preceding sect 
The average reduction in m is a sarah 


e FL 5 ae eee na a2? 
D= oT § ae 271) (Xe; a 2)? oT (La pee q)? - (Le; pe a q)’] 


v'T iq? 
T 


iat 

= 9 (ta; — ai)? + (11-84) 
From optimum K(z), the value of (xz; — zi)? can be calculated. The 

result is a decreasing function of g. Equation (11-84) implies that D is at 


& minimum for a certain val i i i 
ieee alue of g which can be determined by setting 


Example 11-2. The power spectra of xa and 6 are given as 

ae 

(L — 2)(1 — 2) Sas 
$55(z) = 2A (11-86) 


Pr ory soz 


Determine D, g, and W(z). 

Pras Equation (11-85) represents independent step-like variations in the operating 
t -s ion, and Bq. (11-86) can be explained as the result of independent measurements 

With mean-square error 4 for each measurement of period 7), 
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Solution. The result of the preceding section gives 


a A 

—_,, A 1 

(La; Lacs te)? = 2072? G = 1) 
Solving g? in terms of a from Eq. (11-88), 

A (1 -—a)? 
°C, a 
From Eqs. (11-89) and (11-90), Eq. (11-84) can be written as 
we. T:A (1 —a)? 

"30: =9), TC 


D 


In Eq. (11-91) only a depends on q. For optimum gq, dD/da is equated to zero, 


the result is 
a? TiA 
(l—a)1—a*) Tb?C? 


=a 


i i ter 7. 
The second equality sign defines the parame : 
For optimum choice of g, Eq. (11-91) can be written as 


Ton 
d=" fo 
ish 2a: 
where jo * 7B 


Equation (11-90) can be rewritten as 


i M .S.L. Chang, Optimiaat 
. 11-8. The functions fp and fy versus ». (From S 8. 
die Function by Z-tranaform Method, Trans, AEE, pt. U1, July, 1960) 


(11-87) 


(11-88) 


(11-89) 


(11-90) 


(11-91) 


(11 
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Fria. 11-9. The parameter a versus 7. (From 8S. S. L. Chang, Optimization of the 
Adaptive Function by Z-transform Method, Trans. AIEE, pt. II, July, 1960.) 


From Eggs. (11-92), (11-94), and (11-95), 7, fo, and f, are calculated. The functions 
fp and f, are plotted against y in Fig. 11-8, and ais plotted against » in Fig. 11-9. In 
a design problem, » is calculated from given data. The minimum value of D, together 
with the values of g and a to be used, is read from these curves. 


11-13. Analysis of Alternative-biasing Systems. For an alternative- 
biasing system, let the actually applied 2’ be 


at = 2% + (—1)ig 
, 


(11-96) 
Then mi. = Mont — Pa — % — (—1)'g]? + 6; 


(11-97) 


The signal (—1)*m} is used for adjustment of 2; and will be denoted by 
Us. 
j=l 
Lig, = T+ ba WijUj 
j=0 


(11-98) 
ui = (—D)im, = b' gta, — 2) + (—1)'8; 
mite en - og - 2 (ta; — n)*| (11-99) 


xcept for the terms in brackets, Eqs. (11-98) and (11-99) are identical 
with Eqs. (11-41) and (11-42) and b = b’g. In the brackets, Mop: is a 
slowly varying term, (b’/2)q? is a constant, (b’/2)(xe; — x)? is small and 
always positive. If these terms can be approximated by a constant C2, 
the additional disturbance is 


a = (-1)C, 
and its z transform is 
TON | ce Cy 
© "Ter 
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Suppose that a filtering factor 1 + es is introduced bes ae aa 
disturbance 4’ is filtered out except for the initial period 7 a . 
the adaptive control loop must be stable, its respigen a oe aa 
ing function of ¢. The disturbance due to 6; piers iae gee : : ce ‘ 
except for the first few sampling periods LRG BOR, after fe _ = 
the adaptive controller. i i cu ar pry — a 

i ing a filtering factor ; 
ee cake re D can be neglected. This point can be verified by 
: ep ateants a Fig. 11-3 gives a fairly good approximation of 
an See aos tiiind system with the additional requirement that 
1 + 2-! is a factor in the numerator of W(z). 

To determine the optimum form of W(z), let 


K(z) = 231 + =)F(@) (11-100) 


where F(z) is an unknown function with all its poles inside the unit circle, | 
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Following the same steps as Example 11-2, the results are 


————— 5-a ; 

Sat s 
(ta; — i) aos Cy (11-108) 
ti Sopeie he = 11-109 
a ee Te a’)(1 — a)? ce) 

b'C,5+8a-—a? vcr 

= re 4 % 

2 40 —a¥) 2 (fp + 34) (11-110) 
/ a / 

Pe Cie = tI: (11-111) 
where fp and f, are the same functions as defined in Eqs. (11-94) and (11-95). Given 


n, the values of fp, f,, and a can be read from Figs. 11-8 and 11-9. 


11-14. Comparison of the Two Types of Extremal Seeking Functions. 
The two types of extremal seeking systems can be compared on the basis 
of equal ®,,.,(s), measuring interval T,,and A. Fora derivative-sensing 


: rie : int outside the unit circle. The system, T is much larger than 7’; and will be assumed at least 47; for 
The point z= —1 4 SERRA 2 BO the present purpose. For an alternative-biasing system, T = T;. For 
method of Sec. 6-8 gives oe: &..2. (2) (11-101) instance, let the following values be assumed: 
xe a ae « 
K() = Yq la +e )¥(e) _ 0.025 
where Y(z) is defined by a Pi 
z) d _ 
¥(@)Y() = Seqeq(@) + Fenas (ig A= 0.190? 


) is th led spectrum of (—1)*6; The calculated results are listed in Table 11-2. 
and ;;(z) is the sample ke 


TaBLE 11-2. Comparison or ExtTREMAL SEEKING Systems 


Ga aa = gif {I — KO — KEM (0 


—_—_—_———————————————— 


K(z)K(z-) dz : et; Sensing Alternative 
ag KOR B53 (z) zZ (IL-l “pny systems biasing 
b” Vy (11-4 RANE | Ries 
«ae em 
Bm aie he 2 c, | 0.20; 0.50; 0.050; 
° in Example 11-2, 7 0.625 0.04 40 
Example 11-3. Assuming the same power spectra as given in % fo | 2.20 1.36 7.05 
i timum W(z), q and D. PLS 0.51 0.17 2.15 
Tpalidien. Since the ‘smocessive samples of 6; are uncorrelated, 3 ar O47 ries 
ia 0.204C, 0.42C, 0.107C, 
$55(z) = B5x(z) = A D | 0.22b'C, | 0.340’C, | 0.182b'C, 


. . A DS ik miso?! ioe 
Equation (11-101) gives (1 — a)e“(1 + 27) 

ae 

(i =a) +2 
W() = opti + (a — a)/2|e") 


b’igaC 
A 


K(z) = Table 11-2 shows clearly the advantage of an alternative-biasing sys- 
fem. Not only is its deviation funetion D lower, the advantage of which 
in clear, but its lower value of test. bias q is also a desirable feature, A 


1 o+ lower value of q means less disturbance to the system, 
where a+ a j 
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11-15. Summary. The problem of measurement and adjustment in 
the presence of an appreciable amount of random disturbance is studied. 

Two types of measurements are of direct concern to control engineers: 
measurement of statistical properties of input signals and measurement of 
transfer functions (or impulse-response functions). For both types the 
integral-square measuring error is approximately inversely proportional to 
the time of measurement. Much less time is needed or much less error is 
obtained if some a priori knowledge of the measured quantity is used. 

A study is then made on the optimum utilization of measured data to 
make adjustments in a self-optimizing system. A basic assumption is 
that there is a value of the adjustable parameter x. which gives the desired 
result in every interval. The desired result is to keep a certain performs 
ance parameter either at a prescribed value or at an unknown extremal 
value. 

In the learning situation, z is unknown but is a constant. The 
method of stochastic approximation can be used to arrive at a in spite 
of the random disturbances and the fact that little is known about t 
adaptive control loop. The method can be described as a process of suce 
cessively reducing the gain of the adaptive control loop. As the true 
error signal is coherent but the error introduced by random disturbances 
is incoherent, it is possible to select a sequence of gain values whieh 
decreases slowly enough so that any coherent error is corrected, but f 
enough so that the cumulative effect of the disturbances is negligible. 

In the tracking situation, the situation parameters vary with ti 
While the value of 2. is not known to the designer, its power spectrum 
assumed to be known. This point is deemed essential. If one does 
know how fast the operating situation varies, there is no possible 0 
mization of the adjustment procedure. Assuming a constant situath 
the more measurements one averages, the less error one gets. Obvio 
this is not the answer. 

Given the power spectrum of za, probable error in each measurem@ 
and the dependence of the performance variable on the adjustable para 
ter z, the minimum value of D and a set of weighting factors w cam 
obtained. The increment in x is computed as a weighted sum of 
present and previous measurements. The set of weighting factors 
gives optimum adjustment for a nominal situation and near-optimitl 
adjustment in general. 

Two types of extremal seeking systems are studied: the derivati 
sensing systems and the alternative-biasing systems. In addition 
determining the minimum D and the weighting factors, a method 
determining the optimum value of test bias is also developed, So f 
performance is concerned, the alternative-biasing systems are found 
more satisfactory than the derivative-sensing systems, 


CHAPTER 12 


COMPUTER OPTIMIZATION OF NONLINEAR SYSTEMS 


12-1. Introduction. There are a number of control problems which 
are far too complex for manual analysis, for instance, the general non 
linear problem introduced in Sec. 9-2. Even with f(t) given, there is as 
— general method for exact integration of the system of Habe d ti 
a x The maximum principle is no more than an aid to finding f(t) 
by oubling the number of unknown dependent variables. A straight 
analytical solution of the optimal-path problem is not gals nonexi ie 
but far from being in sight. : ig 
oon —_ and a number of similar problems, it is possible to utilize 

e hig! -speed large memory capacity of a modern digital computer for 
Bi cats optimal control. There are two stages of operation: 
eal prs Bimbo stage. The optimum trajectories are computed 
re jee ex! escribing the system. These trajectories together 
pies asl _ ing control forces f(t) are then stored in the same or a 


2. The control stage. The st ili i 
another for actual ata of ht ee ee 

In this chapter we shall first give a brief description of the various 
methods of computer control in carrying out the above-mentioned second 
stage. As the required operations such as data transfer storage, readout 
and interpolation are routine computer operations, we shall not mn furth ‘ 
into the subject. Our main concern is the first ee : how to compute re 
optimum trajectories. Two methods can be used, namely: ' : 


i Dynamic programming 
2. Digitized maximum principle 


These methods as well as thei 
eir advantages and limitati i 
developed in the subsequent sections. a a 


12-2. Types of Computer Control. 
act ie control of a system, one must provide means for counteracting 
random disturbances and discrepancies in system parameters, There 
ure at least three ways of accomplishing this. 

800 


In utilizing the stored data for 


eee 
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be correction loop can be linearized in much the same way as Eq. (9-3) 

e correction servo loop is essentially a li i i ni 
Beer y a linear system with time-varying 

* Digital Feedback System. The block diagram of a digital feedback 
se cons is shown in Fig. 12-34. A control command for a terminal point 
se ects not a single optimum trajectory but a whole group of optimum 
Tajectories terminating at b. These trajectories form a horn-shaped 
region in phase space, as shown in Fig. 12-3b. 


1. Input Control. A block diagram of an input control system is 
shown in Fig. 12-1. The control command is usually a set of data which 
completely specifies the optimum trajectory, ¢.g., the terminal point of 
the trajectory (since the starting point is already known). From these 
data, the computer selects the optimum trajectory and feeds the instan- 
taneous values of the state variables X(é) to the closed-loop control sys-= 
tem. With a high loop gain, the actual x(t) is kept very close to £(). In 
computing the optimum trajectory, the assumed limiting values of f(¢) 
are less than the actual limiting values, and there are some reserved 


For every measured 


Disturbances 


Digital computer Controlled igita 
Controller system Control ons 
Control, eed command r | Controlled 
comman 5 
optimum pee! of baad 
trajectory opegiun 
trajectory x 


from c, x (t) 


Fig. 12-1. Block diagram of an input control system. 


Disturbances 


(a) 


| 
| Controlled 


Digital computer 
e system 


%2 


Control 


Selection 
command 


of 
optimum 
trajectory 


Fic. 12-2. Block diagram of a programming control system with perturbation 
rection. 


3a 
capacities in the control forces to overcome load disturbances and 
discrepancies between assumed and actual system parameters. 

Sometimes the controlled system is a moving vehicle and x(t) is de 
mined from ground stations. Then e(é) is sent to the vehicle instead 
x(t). The principle of operation, however, remains the same. 

2. Programming and Perturbation Correction. A system of this type 
illustrated in Fig. 12-2. From the control command data the comput 
selects the optimum trajectory and feeds a signal proportional to t 
to a servo which duplicates the control forces faithfully. Owing bo 
presence of disturbances and the fact that the system parameters 
not be the same as the ones assumed in the computational stage, *() 
not the same as ¢(t). The error is applied to a correction servo W 
generates a set of correction forces a(t), which may or may not bot 
fications on f(t). If the disturbances and discrepancy in a@ are 


x3 (0) 


Fia. 12-3. Block diagram of a digital feedback system. 


x(é), the optimum f(f) is in i i j 
tories. This ere Son eae! staiaiay aay ora 
ot ype paket can be made by computing the system parame- 
mp: i y wed large number of successively measured x(t)’s. 
“! Pos ciable change in a is indicated, the horn-shaped region of 
ig. 1 is modified to suit the new system parameters, thus i 
porating the adaptive concept. sap 
he above describes three basie types of computer control. Types 1 
and 2 are easier to instrument, while type 3 is closer to being a truly 
optimum system. In a complex control system consisting of a number 
of loosely coupled degrees of freedom, the most suitable type for each 
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degree of freedom may not be the same. A combination of the three 
basic types is then desirable. : 

"12-8. T ypes of Control Problems. The controlled system is described 
by a system of equations [the same as Eq. (9-1) except that n, m are 
replaced by m, m’ to save the symbol n for later assignment}: 


é; = Fi(x,f) 


where 21, 2, . . - » tmare the state variables and fy, fo, . - - » fm, are be 
control forces. In a general problem, f1, fz, . - - ; Sm, are also — 
choice variables and are restricted by a system of constraints of the follow- 
ing form: 


ene oa (12-1) 


TAO) ne oe oy SZ (12-2) 


Equation (12-2) can be generalized by allowing each fi, to have more than 
one allowed range. The optimization problem is to select f(t) within the 
allowed range so that one or the other of the following conditions i# 


satisfied. ee 
1. Minimal Time. The system is to travel from an initial point X = @ 


to a terminal point x = b in minimum time. - 
2. Maximum Range. Ina given interval 7, f(¢) is selected to maximi 
the distance along c: 


c+ (x(7) — x(0)] = max (12-8 


where c is an arbitrary constant vector. 
3. Minimal Cost between Terminal Points 


fe U(x,f) dt = min 


T is unspecified but the initial and final points are specified: 


x(0) =a x(T) =b 


A given condition on U is 
Ui > 0 


4. Minimal Cost in Fixed Interval. The interval T is given but xf 
is unknown and Eq. (12-4) is to be satisfied. 

We shall show that the above problems are not independent, 
u(t) be defined as 


u(t) = U(x,f) di 


Then au = U(x,f) 


For the problem of minimal cost between terminal points we 
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u as the independent variable and rewrite Eq. (12-1): 

da; _ %; _ Fi(xf) _ 5. 

du ~ % ~ Uf = PA) sem) 


The problem reduces to a minimal-time problem, with Eq. (12-9) as the 

new system equations. For the problem of minimal cost in the fixed 

interval T, we may define n41(t), Fin+i(x,f), and an (m + 1)-dimensional 
vector ¢ as 

Cm+i(t) = u(t) 

Pni(%f) = U(x,f) 


(12-10) 
2s ee ee 


c =0 Cn41 = =f 


Thus Eq. (12-8) is combined into Eq. (12-1) with m changed to m + ile 
and Eq. (12-4) becomes Eq. (12-3). 

Using the above reduction, we have only two basic problems left: 
minimal time and maximum range. 

12-4. Digitized System Equations. In order to use a computer, it is 
necessary to approximate the system equations (12-1) by a set of differ- 
ence equations. Let the interval from 0 to T be divided into N subinter- 
vals T’, T = NT’. Let x(n) and f(n) be defined as follows: 


x(n): value of x(t) at ¢ = nT” 
f(n): average value of f(t) in the interval nT’ < t < (n + 1)T" 


With sufficiently large N, Eq. (12-1) can be closely approximated by 


x(n + 1) = a(n) + T’PF[x(n),f(n)] ite WM Dera. cs 


= Vy 
ALON De 


.,N—-1 (12-11) 


The system of constraints becomes 


\fi(n)| <1 (12-12) 


12-5. Basic Principle of Dynamic Programming.| Dynamic program- 
ming as developed by Richard Bellman is a very simple and powerful 
concept. It may be billed as the general technique of optimization using 
a digital computer. The amount of literature on the subject is quite 
voluminous despite its relatively short tenure in human knowledge. 
Because of space limitation, we shall develop here only a very basic ver- 
sion of dynamic programming. 

Let us consider a deterministic process in which the state of the system 
is specified by a set of m state variables x1, x2, ... , tm. AS before, 
we shall denote these variables by the vector x. Let 7, (_) denote a set 
of transformations which transform one vector x into another. The 

| Sections 12-5 and 12-6 are based on R. Bellman, “Dynamic Programming,” 


chap. 3, Princeton University Press, Princeton, N.J., 1957, with some of the present 
author's own interpretation, 
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identity of the new vector depends, of course, on both the original vector 
x and the transformation selected as specified by the value of y. An 
example of such a transformation is Eq. (12-11). By selecting f(m) which 
specifies the selected transformation, the vector x(n) is transformed into 
a new vector x(n + 1). 
An N-stage decision process is defined by the selected sequence of NV 

transformations: 

X= Ly, (<) 

Xe Ty, (x1) 

xn = Tyy.(n-1) 


(12-13) 


These transformations are selected to maximize a given return function of 
the state variables, R(x), in the final state. In other words, R(xy) is 
to be maximum. 

Let Qy(Xo) be the set of all points that can be reached from xo by N 
transformations. Let My(xo) denote the maximum value of R(x) for 
all x in Qy(xo). Then, by definition, 


My(Xo) = ro R(xy) 


(12-14) 


where Y* denotes the sequence of transformations yo, Y1, - - + » YNob 
The right-hand side of Eq. (12-14) means the maximum value of R(xw) 
by an optimum choice of yo, yi, - - - , yv—1 Which, of course, is a choice 
leading to maximum R(x) in Qy(xo). 

We note that the optimum xy may not be unique, and for any given 
xy the choice of YY may not be unique. But My(Xo) is always unique, 
since in the closed region Qy(ao) there must be an upper limit to R(x), a 
this upper limit is My(xo). 

The selection of a sequence of transformations is called a poliey 
A policy that leads to maximum return, or R(xy), is called an opti 
policy. Suppose that Eqs. (12-13) represent an optimal policy. iro 
any point x,,n < N, we assert that it is not possible to arrive at a hig 
R(xy) in N — nsteps (or transformations). If this were not true and 
higher R(x) could be obtained by the selection y/, Via >» or 
then the policy yo, yi, - - - > Yn—1Yhy Yours «+ + » Ya would lead to 
higher return from Xo, in contradiction to our assumption that Yo, 

. . , yn-1is an optimal policy. 

The principle of optimality can be stated as follows: The part of 
N-step optimal policy leading from any intermediate stale X» to the fi 
state xy is an (N — n)-step optimal policy from Xn. 

In contrast to this principle, any part of an optimal policy which @ 
before reaching the final state xy is not necessarily an optimal po 
To see this, let us consider a simple example as shown in lig, 124. 1 
transformation 7 consists of 7',, which adds 1 to 2, and 7_, whieh 
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tracts 1 from x. Consider a three-stage decision process starting from 
x = 0. The optimal policy is obviously +++. Starting from « = 1, 
+-+ is a two-stage optimal policy. Starting from « = 0, ++ is not a 
two-stage optimal policy. 

From the principle of optimality, we can derive a recurrent relation of 
the maximum return function My(x). In choosing the transform T,(_), 
we know that the sequence of choices following y is also optimum by 
itself. For any y we select, the ultimate return function is 


My-{T,(x)] 


Therefore, y is selected to maximize the above function. We have finally 
My(x) = max My_,[T,(x)] 1 ee a (12-15) 

y 
M(x) = max R[T,(x)] (12-16) 

y 


Equation (12-15) cannot be solved all by itself. We do not know what 
My-:(%’) is for every x’ within reach of one step (or transformation) from 
x. However, the class of functions My(x) for all x and N ean be solved 
in a rather straightforward manner. 
For each x we can find the maxi- 
mum value of R(x’) among all x’ in 
Q4(x) [Q1(x) is the set of all points x’ 
that can be reached by one step 
from x]. This is M,(x). Then for 
each x we can find the maximum 
value of M,(x’) among all x’ in 0;(x). 
This is M2(x), ete. Thus, in order 
to find a solution for My(xo), we 
find first a solution for all M,,(x) 
with n < N. As the latter func- 
tions can be computed asa class, My(xo) isalso determined. This method 
of enlarging an unsolvable problem into a solvable class of problems so that 
a solution can be obtained is called the technique of invariant imbedding. 

The problem of minimal return can be treated, of course, in exactly 
the same way, with the symbol max changed to the symbol min. 

12-6. Application of Dynamic Programming to the Problem of Mini- 
mum Cost in Fixed Interval. The equations describing the dynamical 


system are Eqs. (12-11) and (12-12). The N-stage return function to be 
minimized is 


R(x) 


Fia. 12-4, A simple decision process. 


n=N-1 


7 > U[x(n),f(n)] (12-17) 


In order to apply dynamic programming, we have to make (12-17) a 
function of the state variables, This can be done by introducing a new 
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state variable 2m+.1: 

Ime + 1) = m4i(n) + T’U[x(n) £(n)] 
To minimize (12-17) is the same as to minimize Im+i(N). 
function RF is then 


(12-18) 
The return 


EX Gt) = Im+1 (12-19) 


In Eq. (12-19) and throughout Sec. 12-7, x is used to denote the m-dimen- 
sional vector a1, t2, . . - , Um, not the (m + 1)-dimensional vector. 

The minimum N-stage return My(x,@m41) is the initial value of Salim 
plus the minimum value of (12-17) which is a function of the initial x and 
N only. Therefore 


My(X,€m41) = Um4i + By (x) (12-20 


From Eqs. (12-11) and (12-18) we obtain an explicit form for the tra 


form T: 


T¢(X,Xm+1) = {x + T’F(x,f), XLm+1 + ee U(x,f)] (12-21 


Substituting Eqs. (12-20) and (12-21) in (12-15) gives 
Imai + By(x) = min {amy1 + T’U(x,f) + Byalx + T’P(,f)]} (12-22 
fl 


Since &m41 is independent of f, a recurrence relation of By(z) is obtained 


By(x) = min {7’U(x,f) + By-alx + T’F(x,f)]} 
for ‘N = 2; 3; 45 Ve 
B,(x) = min [T” U(x,f)] 


Equations (12-23) and (12-24) can be solved by the technique of inv: 
ant imbedding in much the same way as Kgs. (12-15) and (121 
B,(x) is simply the minimum value of 7” U (x,f) for all allowable i 
By-_1(x) is known for all x, By(x) is computed as the mini sum 
all allowable f) of T’U(x,f) and By_.(x’), where x’ = x + 7 F(x,f). 

In order to compute By(x), we need only the data on By-1(x). I 
fore, once By(x) is computed, both By-1(x) and the optimum f associ 
with each x can be transferred to a magnetic tape. The required { 
memory of the computer is not much more than what is necessary rf 
sufficient coverage of By(x) for one value of N only. The optimum f 
obviously different for each x and N, and we shall denote it as £(x, 

Suppose that we are interested in finding out the optimum procem 
well as the minimum return for an Nj-stage process starting from 
The minimum return is immediately read from the tape, as it is wi 
By,(X0). We also read from the tape f(x0,N ie This is £(0) in the 
tion of Eq. (12-11). For the next point, x(1) is determined by 


x(1) = Xo + T’F[X0,f(0)] 
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Repeated use of 

f(n) = £f[x(n), Ni — 7] (12-25) 
and Eq. (12-11) gives the complete optimal trajectory. Of course, the 
past history of x can also be recorded for each x and N to save calculation. 

One serious drawback of the technique of invariant imbedding is the 
memory required with a multidimensional vector x. Of course, one 
cannot record My(x) or By(x) for every x even for a one-dimensional 
problem. However, as most problems have limited range (or region) 
of x, it is generally possible to place discrete values of x close enough for 
one- or two-dimensional problems so that interpolation for the in-between 
values is accurate enough. For higher dimensions, the required memory 
quickly rises to astronomical proportions, and a direct application of 
dynamic programming becomes impractical. 

There has been some recent work on how to overcome the difficulty of 
multidimensionality. However, as the mathematics is quite involved, 
it will not be treated here. 

12-7. Digitized Maximum Principle.t One way to avoid the expo- 
nential rise in required memory capacity with the dimensionality of x 
is to compute along one single trajectory at a time, rather than one stage 
in the entire phase space at a time. Pontryagin’s maximum principle 
gives promise of the possibility. However, an arbitrary conversion from 
continuous to discrete form introduces errors, the cumulative effects of 
which in hundreds of steps are difficult to determine. In the present 
section, a maximum principle is derived directly for discrete-time (or 
sampled-data) systems of the type described by Kas. (12-11) and (12-12). 
or computation of continuous systems, the only error introduced is in 
approximating Eq. (12-1) by Eq. (12-11). Once this is done, the remain- 
ing computation process is exact except for the computer round-off 
error. 

In the following development, the assumptions are: 

1. The dynamical system is described by Eqs. (12-11) and (12-12) 
in a region R (2 may or may not be bounded in all directions). 

2. The function F,(x,f) is single-valued and has bounded first and 
second partial derivatives of x and f for all x in R, all f satisfying the 
inequality (12-12) and all7,i=1,2,... ,m. 


Let T be a trajectory defined by 
ia)=fn) n=0,1,2....,N—-1 (12-26) 
where f(n) satisfies inequality (12-12). With given x(0), the solution of 
Iq. (12-11) is unique and is denoted by x(n) = RN) Hr 1D oe AUN, 


{A more complete treatment can be found in a paper by the author, Computer 
Optimization of Nonlinear Control Systems by Means of Digitized Maximum Prin- 
viple, presented at IRE International Convention, New York, Mareh, 1961, 
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Let I’ be an adjacent trajectory defined by 


f’(n) =f(n) + a(n) n=0,1,2,...,N—-—1 
where \f’(n)| <1 |6f(n)| <e 


The solution is denoted as X(n) + 6x(n): 
£(n + 1) + bai(n + 1) = €:(n) + da;(n) 


+ T’'F {x(n) + dx(n), f(n) + f(n)] 7 = 1,2, . 
Since &:(n + 1) = €(n) + T' F {&(n),F(n)] 
the difference of the above two equations is 
j=m 
6a,(n + 1) = da,(n) + T’ » (oF). 6x;(n) 
j=l hes 
+7 y (2) afi(n) + O12) (12 


In the above equation, O(e2) means a quantity of the order of ¢?, and t 
subscript n denotes that the partial derivatives are evaluated for K( 
and f(n). For any given T, these partial derivatives are functions of 
only. The boundary condition is 6x(0) = 0. 

It is convenient to express Eq. (12-29) in matrix form: 6x and éf ¢ 
be written as column vectors. Let P(n) and B(n) represent m X m 
m X m' matrices with 6, + 7’(dF;/dz;)n, and T’(dF;/df;)n as their res ) 
tive elements of the ith row and jth column, where 6,; is the Kroni¢ 
symbol 6, = 1, and 6, = Oforz #7. Equation (12-29) can be written 


6x(n + 1) = P(n) 6x(n) + B(n) df(n) + O(e) (12 
Equation (12-30) can be solved step by step: 
6x(1) = B(O) 6£(0) + O(e) 


6x(2) = P(1) 6x(1) + B(1) 6f(1) + O(e’) 
= P(1)B(0) 6f(0) + BQ) 6f(1) + O(e) 


There is no need to write P(1)O(e2) + O(e?) since it is still a quantity. 
the order of «2. Repeated application of the above process gives 


(1264 


n=N-1 
éx(N) = ) A(n)B(n) df(n) + OC) 
n=0 
where A(N — 1) =1 


A(n) = P(N — 1)P(N —2)+-+P(n+1) n=0,1,... 


From Eq. (12-34) we can derive a recurrent relation for A(n): 


A(n — 1) = A(n)P(n) mae, Qiaiven sae 
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Now let us derive a necessary condition for f(n) to give an optimal Yr. 
1. Maximal-range Problems. If T is optimal, then from Eq. (12-3) we 
have 
c - [&(N) — x(0)] > c+ (X(N) + dx(N) — x(0)] 
The above inequality may be written as 
c- 6x(N) <0 (12-36) 


2. Minimal-time Problems. Let Qn(xo) represent the set of all possible 
terminal points of the system at the end of N steps starting from xp. 
Since the allowable range of f is continuous, Qy(xo) forms a continuous 
region in the space 21, x2, . .., %m. We assume that the number of 
steps is numerous enough so that Nm’ > m and Qy(Xo) is an m-dimen- 
sional region. For any terminal point b, an N can be found such that 
b belongs to Qy(xo) but does not belong to any Qy(x0), N’ < N. If 
b is an interior point of Qy(xo), there are, in general, infinitely many 
optimal trajectories. This point becomes obvious if we recall the geo- 
metrical concept of Sec. 9-2 and draw an analogy between the discrete 
and continuous cases. In the continuous case, terminal points of optimal 
trajectories of duration 7 are always on the surface of Q(t < T). In the 
discrete approximation, any terminal point b which has a minimal time 
from Xo between (NV — 1)T’ and NT” will have to be reached at the end of 
NT", and generally these points lie inside Qy(xo). Since we are using 
more time than necessary in arriving at b, there are usually infinitely 
many ways of doing this. 

As we are looking for the truly optimal trajectory, what we want is not a 
trajectory which terminates at b but passes beyond b and terminates at a 
point on the surface of Qv(xo). Therefore, the following assumption may 
be made without loss of generality: The terminal point of the optimal 
trajectory ts on the surface of Qy (Xo). 

Let T denote an optimal trajectory with its terminating point X(N) 
lying on the surface of Qy(xo). We shall show that there exists a vector 
§ such that 


é- 6x(N) < O(e) 


Inequality (12-37) is easy to prove if we make the assumption that a 
normal exists to the surface of Qy(Xo) at (N). Since *(N) + 6x(N) isa 
point belonging to Qy(x0), we may choose & as a normal pointing outward 
from X(N), and inequality (12-37) follows immediately from the definition 
of a normal. However, for all we know, the surface of Qy(xo) may be 
like sandpaper rather than like that of an apple. Assuming the existence 
of a normal may be assuming a little too much. We shall give instead a 
longer proof without this assumption. 

To prove (12-37), we shall need the concept of a convex region, which 


(12-37) 
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may be defined as follows: A region Q is called convex af at every boundar 
point p; there exists a vector € such that 


&-Ve <0 


for every vector Viz joining p, to another point pz in Q. 

The concept of convexity can be illustrated by the two-dimensio 
regions of Fig. 12-5: Q, is convex, but 2 and Q; are not since — does n 
exist for p; and ps. A well-known theorem about convex regions is: 

A necessary and sufficient condition for a region Q to be convex is that ev 
point on the straight line joining any two points in Q is also a point in Q. 

For instance, part of the straight line pips lies outside Q, and t 
straight line p7ps lies outside of Q. 


Ag, 
| 
lp, 
P\>%<~ 
4 P. 
5 
5 
— — 
Af 
2, Q, 


Fiq. 12-5. Convex (Q:) and nonconvex (Q and Qs) regions. 


Let Qy(I',e) denote the set of all terminal points x(N) of all adja 
trajectories defined by Eqs. (12-27) and (12-28). Because Qn (Tye) ia 
subset of Qy(xo), and (NV) is on the surface of Qy (xo), (NV) is on the 
face of Qy(T’,e) also. 

If the small term O(c) is neglected, any point 6x(N) on a straight 
joining 6x(N)' and 6x(N)’’ can be expressed as 

n=N 


-1 
6x(N) = cy 8x(N)! + (1 — ex) dx(N)” = A(n)B(n)lex af(my 
n=0 
+ (1 = es) fw 


with 0 <c, <1. Since both df(n)’ and éf(n)” satisfy Eqs. (12-27) 
(12-28), ¢1 6f(n)’ + (1 — ¢:) df(n)” also satisfies these equations, 
&(N) + 6x(N) is a point in Qy(T,e). Therefore, the region Qy (lye) 
convex in this approximation. As x(n) is a boundary point, there @ 
a & such that 

E+ [&(N) + ox(N) — X(N)] = &- dx(N) <0 


However, as this approximation is good only to the first order of 
have inequality (12-37) instead. 


COMPUTER OPTIMIZATION OF NONLINEAR SYSTEMS 321 


The weaker condition (12-37) includes (12-36) as a special case. In 
matrix notation, we can treat — as a column vector and denote the trans- 
posed & (or the corresponding row vector) as &. Inequality (12-37) can 
be written as 


& 6x < O(e’) (12-38) 
From Eqs. (12-32) and (12-38) we have 
n=N-1 
YF A(n)B(n) af(n) < O(e) (12-39) 


Let the row vector ’A(n) be denoted as 2’(n). It satisfies 


RN TN) a (12-40) 
(mn — 1) = X’(n)P(n) nm=1,2,...,N—1 (12-41) 


Inequality (12-39) can be written as 
n=N-1 
\’(n)B(n) df(n) < O(e?) 
n=0 
As éf(n) for each n is independent of all others, we have finally 
WN (n)B(n) df(n) < O(e?) (12-42) 


for all allowed 6f(n) [satisfying (12-27) and (12-28)]. 


Inequality (12-42) is equivalent to maximizing . di(n)F [X(n) ,£(n)] 
t=1 


by the choice of f(n), with 4(n) and &(n) considered fixed. It does not 
rule out the possibility, however, that /F is stationary at the optimum 
value of f(n). The function 2’F is called the Hamiltonian. The digi- 
tized maximum principle can be stated as follows: Along an optimal 
trajectory, the Hamiltonian W' (n)F|&(n),£(n)] is etther maximum or stationary 
with respect to infinitesimal allowed variations of £(n). 

Being maximum is not necessarily being stationary in the present case 
because of the constraints (12-12) on f(n). 

12-8. Computation Procedure Using the Digitized Maximum Principle. 
It is convenient to start the computation procedure from the terminal 
point and work backward. For the minimal-time problem £ depends on 
the initial point x» but there is no known relationship between £ and Xo. 
llowever, as the problem is usually, with the terminal point b known 
what are the best f(é)’s for different initial points xo, the drawback is not 4 
serious one, and the mapping between & and xo is automatically carried out 
in the computing process. Tor the maximum-range problem & = ¢ but 
b is unknown for any given xo and the same situation holds. 

For each trajectory specified by a terminal point x(N) = b and a 
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vector £ = 4(N — 1), a step-by-step computation can be made. The 
order of computation is as follows: 


1. f£(n), x(n) 

2. P(n) 

3. A(n — 1) 

4. f(n — 1), x(n — 1), ete. 


To see how this works, let us assume that f is a one-dimensional f. When 
x(N) and 2(N — 1) are known, the steps are as follows: 

1. Determine f(N — 1) and x(N — 1) simultaneously by the following 
substeps: 

a. Assume an f(N — 1), say 1. 

b. Caleulate x(N — 1) from Eq. (12-11) by a looping procedure (su 
cessive approximation). With asmall 7’ the result converges very fast. 


i=m 


c. Calculate H = oY \(N — 1)FiAx(N — 1), £(N — 1). 


d. Assume a slightly different f(N — 1), say 0.99, and recalculate I 
We note that, in substep d, x(N — 1), is not recalculated. 

If the recalculated Hy_, in d is smaller, then the value of f(N — 1) 
assumed at the beginning is correct, and we go on to the second step. 
not, we repeat the above substeps: 

a. Assume f(N — 1) = —1. 

b. Calculate x(N — 1). 

c. Calculate H from substeps a and b. 

d. Assume f(N — 1) = —0.99 and calculate H from b and d. 

If H as calculated from d is smaller, we proceed to the second step Wi 
f(N — 1) = —1, and calculate x(N — 1). If it is larger, we assume 
new value of f(N — 1), say 0, and repeat the above calculations. 
situation is illustrated in Fig. 12-6. The dot represents the assu 
value of f, and the arrow indicates the sense of dH/éf. In Fig. 1 
we have (1,+) at the first trial, and it is satisfactory as f maximizes 
In Fig. 12-6b we have (—1,—) at the second trial, and it is satisfae 
for the same reason. In Fig. 12-6c we have (1,—) (—1,+), (0,-++) for 
first three trials. Trials 1 and 3 indicate that H has a peak somew 
between 0 and 1, and so we try 144. Subsequently we get ('9,—), 
(14,0). At f = 44, H is stationary and we proceed to the next step} 

2. Calculate fori = 1,2,...,mandg =1,2,,..) 


P(N —1)=14+T7" (=) 
OX; N-1 


uv ( OF; : " 
P(N —1) =T (F) fort # j 


The partial derivatives are evaluated for x(N — 1) and f(N = 
calculated from step 1, 
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8. Calculate fork = 1,2, ...,m. 
j=m 
M(N — 2) = Y ACN — 1)Pa(N — 1) 
j=1 


Now we are ready to repeat step 1 again to calculate f(N — 2) and 
x(N — 2), etc. In the second and subsequent rounds we can usually 
save work by assuming at the first trial an f(n) equal to f(n + 1) of the 
preceding round. The optimal f 
usually stays at one extreme value 
for many periods of T’ and then at } 
the opposite extreme for more periods 
of 7’, ete. Under such situations, 
step 1 is concluded at the first trial. 1 Trials 

For f of two or three dimensions, 
we can generalize the method of Fig. ~! 

12-6 by starting our calculations on 

the lattice points of a coarse lattice 
in f space, and then refine our calcu- 
lations on finer and finer lattices until 
the point for maximum # is found. 

It is unusual but not impossible to 
have two or more values of f which 
make H locally maximum or locally 7-1! 
stationary. In such a case, the com- (bd) 
puted optimal trajectory branches off f 
into two or more branches. 

In computing backward, the sig- 
nificance of x(NV — n) is that it takes 
n steps to arrive at the terminal point © 
x(N) from x(V — n) along an opti- 
mal trajectory. The number n is -1 
significant but not NV or N — n. In 
actual computations, it is desirable 
to write x(N) as x(0), x(N — 1) as 
x(1), ete. However, the original numbering sequence is kept throughout 
the present text to avoid confusion. 

For the minimal-time problem, the terminal point x(N) is fixed, and 
we compute a different trajectory by assuming a different & The 
collection of trajectories so computed forms a horn-shaped region, as 
shown in Fig. 12-3b. For the maximal-range problem, —& = c, and we 
compute a different trajectory by assuming a different, x(NV) satisfying 
c+x(N) = const. The result is a region as shown in Fig, 12-7, The 
solid lines show the optimal trajectories to be followed, and the broken 
lines marked N =» mean that, at these points, it takes nm periods to 


(a) 


Trials 


Trials 


(c) 
Fic. 12-6. The process of adjusting f. 
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arrive at a range of [x(N) — x]-c. Sometimes there are two trajectories 
meeting at the same point such as A. The upper trajectory is the opti- 
mal one since it has lower n. 


Fra. 12-7. Optimal trajectories for the maximum-range problem. 


12-9. Summary. The computation of optimal trajectories for general 
nonlinear systems has two alternative objectives: 

1. It serves as a standard of performance against which the per- 
formance of a practical system can be compared. 

2. In very important or very expensive systems, it is worthwhile to 
instrument for the theoretically best by incorporating a digital computer 
in the control system. 

Three ways of realizing the second objective are discussed: 

1. Input control 

2. Programming with perturbation correction 

3. Digital feedback system 


Methods 1 and 2 are easier to instrument. However, they are o 
nearly optimum, since in computing the ‘optimal trajectories” the f 
range of control forces cannot be used. This is for the purpose of rese 
ing some corrective capacity to reduce the errors introduced by load 
turbance and discrepancies between actual system-parameter values 
their design center values. Type 3 is more nearly optimum but is 
more expensive in terms of required computer memory and speed. 

There are two ways of computing optimal trajectories: 

1. Dynamic programming 

2. Digitized maximum principle 


Dynamic programming in its basic form is absolutely reliable. Howe 
the required computer memory capacity increases exponentially with 
number of state variables. Digitized maximum principle is a trajec 
by-trajectory procedure and does not require nearly as much com 
memory, but care must be taken that no branches are neglected in 
computations. As there is no general method of doing this, the br 
ing problem has to be studied for each individual case, 


APPENDIX A 


BASIC STATISTICAL THEORY 


The following is a brief treatment of the elements of statistical theory which are used 
throughout this book. 


A-1. Distribution, Expected Value, Standard Deviation, Moments. A chance 
event has a plural number of possible outcomes: a, b, c, .... Let P. denote the 
probability of a, P, the probability of b, P. the probability of c, . . . ; the set of 


probabilities Pa, Ps, P., . . . is called a distribution. 
If we attach a value V to each outcome and denote these values as Va, Vi, Ve, . . « 
the mean value or the expected value of the event is defined as 


WV) =PWVatPVi+PVet-:: (A-1) 


The bracket { ) denotes the expected value of the expression inside. Let F(V) denote 
an arbitrary function of V; then the expected value of F(V) is 


(F(V)) = PaF (Va) + Po (Vs) + P(Ve) ++: - (A-2) 


Equation (A-1) actually implies Eq. (A-2), since we could have attached F(Va), 
F(Va); FV); « «. <5 Instead of Va, ‘Vi, Ve, .). 2 , to) outcomes a, .B; 6, . +s 
respectively. 

Another point of view is that V is a chance variable, since it may take on any of a 
number of possible values, and it is called a variate for short. The standard deviation 
o is defined as the root-mean-square value of V — (V). Mathematically, 


AP AUTRE) (A-3) 


A more complete set of characteristic values of a distribution are the moments, 
which are defined as the expected values of the powers of its variate. By definition, 
the nth moment is (V”). The nth moment about any point a is ((V —a)"). The 
expected or mean value of a distribution is simply its first moment. The mean- 
square deviation, or o?, is simply the second moment of a distribution about its mean. 

A-2. Continuous Distribution. Sometimes, the possible value of a variate is con- 
tinuous, for instance, the weight of a human being, the peak voltage of a lightning 
discharge, or the distance traveled by a thrown ball. The probability of V being 
within a range of values V; + dV/2 is proportional to dV, if dV is small enough, and 
can be written P(V:) dV. The function P(V) is referred to as the density function 
of V. Sometimes it is also called the probability density. 

The mean value of V and of the function F(V) becomes 


? 


(v) = [7 vemav (A-4) 
(F(V)) = im F(V)P(V) av (A-5) 
Obviously, we P(V) dV = 1 (A-6) 


bub 
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The above integration from — © to « does not lose any generality. If the range of 
possible V is limited to Vi < V < V2, it is only necessary to require that P(V) = 0 
for V < Vi, and for V > V2. 

Sometimes V is discrete in certain ranges and continuous in others. The distribu- 
tion is then represented by finite probabilities at the discrete points and probability- 
density functions in the continuous ranges. Equations (A-1) to (A-5) still hold, 
with the understanding that summation is used in the discrete ranges and integration 
is used in the continuous ranges. 

A more sophisticated approach is to express P(V) in terms of the 6 functions (or 
impulse function, Appendix B) in the discrete ranges. Equations (A-4) to (A-6) 
would then hold good in general. 

A-3. Compound Events. A compound event is one which can be separated into a 
number of constituent events. Let A, B, C, .. . denote the constituent events, 
and ai, @2, a3, . . . 3 b1, be, bs, . . . 3 C1, C2, Ca, . . . 3 . . . denote the possible out- 
comes of A, B, C, ... , respectively. Each combination of aj, bj, cx, . . . is then 
a possible outcome of the compound event. The joint probability P(ai,b;,cz, . . .) is 
the probability that the combination a;, b;, cz, . . . occurs. 

Two events are said to be independent if the distribution of one is independent of 
the outcome of the other, and vice versa. Therefore, if all the constituent events are. 
independent of each other, the joint probability P(ai,b;,ck, . . .) is simply the proba« 
bility that ai, bj, cx, . . . have occurred in turn without consideration of what haa 
happened before: 

P(Gi,b;,¢k, - 


where P(a;), P(b;), P(cx), . . . are the individual probabilities of occurrence. 
Let Vai, Voj, Vek, . . . denote the values attached to the outcomes aj, bj, ch, . . « y 
respectively. From Eqs. (A-7) and (A-2), it can be readily shown that 


(Fi(Va)F2(Vo)Fs(Ve) + + -) = (Fi(Va))P2( Vo) (F's(Ve)) 


where F,, F2, F'; are arbitrary functions of Va, Vs, V., respectively. 

A-4. The Law of Large Numbers. Let us consider the sum V; of the variates of 
chance events. For convenience, we shall change our notation somewhat and use 
subscript 7 to denote the ith chance event, and V; its variate: 


. .) = P(a:)P(b;)P(cx) . . (A-7) 


i=1 


The distributions of the events may or may not be the same, and some of the distri 
tions may be continuous while others may be discrete. Since Eqs. (A-1), (A-4), 
(A-9) are all linear, it follows that 


N 
v= ) W (Acl 
t=1 


Equation (A-10) states that the mean value of the sum is the sum of the mean, | 
the same statement hold for the standard deviation? 


N N 
= pte ))?) = = i P = i- (Vi 
oan wo =<LY w-wn]> Y= om 


i=1 


N N 
+ ¥ Y= was - wo | 


iwl jel 
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The symbol 2’ indicates the sum over all values of j not equal toi. If the constituent 
events are independent, Eq. (A-8) gives, for i ¥ j, 


(Vs — (Vi)) (V5 — (V3))) = (Vi — (Vd) (Vs — (V5))) = 0 
Therefore, Eq. (A-11) becomes 


(A-12) 


(A-13) 


Equation (A-13) holds only if all the events are independent of each other. 


Next, let us consider the special case of N independent trials of identical chance 
events: 


(Vi) = (Vi) o; =o, with =1,2,...,N 


Let Vay denote the average value of the N repetitions. By definition, 


a i (A-14) 
Equations (A-10) and (A-13) become 
(Vav) = (Vi) (A-15) 
o;? a1 
(Var — (Var))8) = $5 = 5 Not = (A-16) 


Equation (A-15) reaffirms the definition of expected value. Equation (A-16) is very 
significant. It states that, by averaging the results of NV independent trials, the mean- 
square deviation is reduced by a factor 1/N. In other words, the averaged outcome of 
a large number of trials is far more predictable than the outcome of a single trial. 
This is essentially the law of large numbers. 

In the next two sections, two theorems will be discussed which put the above idea 
into more tangible form. 

A-5. Tchebysheff’s Inequality. While one would expect a narrow distribution 
when the standard deviation o is small, and a widespread distribution when o is large, 
the idea is given a more precise formulation by Tchebysheft’s inequality. 

Let us consider a distribution P(v) with expected value m and standard deviation o. 
Our purpose is to determine the probability that v lies outside the range m+a. For 
simplicity’s sake, the distribution P(v) is assumed to be continuous, although the 


essence of the arguments applies whether the distribution is continuous or not. By 
definition of « and Eq. (A-5), 


ot = (ev =m) = ["* @ — m)PO) dv + for (v — m)*P(w) do 


44 > ‘@ompPads 


m+a Sasa 


If we drop out the middle integral on the right-hand side of Kq. (A-17) and replace 
(v — m)* by a* in the other two integrals, the value of the expression is reduced by both 


actions. Therefore 
>a? Ihc P(v) dv + diss P(v) dv | 


The expression inside the brackets is the probability that » differs from m by a distance 


(A-18) 
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of more than a and will be denoted as P(\v — m| > a). Equation (A-18) becomes 


2 
P(v — m| >a) <% (A-19) 
The inequality (A-19) holds, no matter what the nature of the chance event. Suppose 
that » is the averaged outcome of N independent and identical trials and that the 
standard deviation of each trial is 01; Eq. (A-16) gives o? = o12/N. Equation (A-19) 
becomes 
a2 


Na? 


Its significance can be illustrated 


P(\v — m| >a) < (A-20) 
This relation is known as Tchebysheff’s inequality. 
by the following example: 

Example A-1. How many throws of dice do we need to average so that for at least 
90 per cent of the time the averaged outcome is between 3 and 4? 


Solution 
m = 3.5 
a=0.5 
o12 = 4 X (0.5? + 1.52 + 2.57) K 2 = 2.917 
P(\j»y — m| >a) =1-0.9 =0.1 
2.917 
* ae ued 1 


A-6. Normal Distribution, Central Limit Theorem. The normal distribution for 
variate » with expected value m and standard deviation o is defined as 


1 
Vino 
It is easily verified that, for the above density function, Eq. (A-6) and the definitio 
of m and o are satisfied. 

The significance of the normal distribution lies in its universality which is stated 
the central limit theorem: 

The distribution function of the averaged value of N independent trials of a 
event approaches normal distribution as N approaches infinity. 

The most significant part of the above statement is that it says nothing about 
distribution of the chance event itself. Thus whether each trial consists of throwi 
a die or flipping a coin or measured instantaneous velocity in a controlled turbul 
flow makes no difference. 

The above statement of the theorem may be generalized as follows: 

The distribution function of the sum (or averaged value) of N trials approaches 
normal distribution as N approaches infinity; whether or not these trials are indep 
and whether or not these are repetitions of the same or a large number of different 
events make no difference, as long as the contribution of each trial, including its aff 
on the outcomes of other trials, is of the order of 1/+/N of the sum (or averaged value), 

Let us consider the case of flipping a coin with equally probable outcomes 1 and = 
If each throw is independent, the distribution of the averaged outcome approne 
normal distribution with zero mean and 1/N aso? for large N (Eqs. (A-15) and (A-10) 
If each throw is partially dependent, for instance, the probability of repeating 
previous sign may be 34 instead of 14, the contribution of each trial is 


Pv) = 


e7 (v—m) 2/202 


(A-21) 


AUtwWtM thts dad 
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The above expression can be explained as follows: A + gives 1 for the value of its own 
trial, }¢ for the expected value of the next trial, and so forth, and the sum is divided 
by N to give its effect on the averaged outcome. Since the latter is of the magnitude 
1/\/N, the generalized version of the theorem states that the distribution of the 
averaged outcome approaches normal distribution. However its mean-square devia- 
tion is larger than 1/N. 

If a variate meets the conditions for normal distribution as stated in the generalized 
version of the theorem, its distribution function is given by Eq. (A-21), with 


m = (v) 
a? = ((v — m)?) = (v2) — (y)? 


Its distribution and higher moments are completely determined by the first and second 
moments. 

The fact that the central limit theorem is pertinent to our study of systems with 
random inputs can be explained in the following light: The value of a system variable 
at any given time is determined as the net effect of inputs of all sources at all previous 
times. If, at any fixed instant, many of these previous inputs are significant and the 
contribution of none predominates, the amplitude distribution of the system variable 
is expected to approach normal distribution. Once its mean-square value is known 
the entire distribution is determined. ‘ 

A-7. Characteristic Function and Moment-generating Function. In the preceding 
section, the central limit theorem is stated without a proof. To prove the theorem 
we shall need some powerful mathematical tools, which will be introduced presently. 
Consider a joint-probability-density function p(x1,r2, . . . aids 
ating function is defined as 


MB 8a) 05. . j8a) fff toe f reeses Suess sO) OXD (y =) dz (A-22) 
ze i=1 


Its moment-gener- 


where dz denotes the product dx; drz + - + dam. If we change s; to —s; and 2; to ¢; 
we recognize immediately the right-hand side of Eq. (A-22) as the multidimensional, 
double-ended Laplace transform of p(x1,22, . . . ,%m). Another way of interpreting 
Kiq. (A-22) is that the moment-generating function is the average value of the expo- 
wm 
nential function exp ( > siti). 
t=1 


If we differentiate the moment-generating function with respect tO 8a, 8, 8, . . 


in succession, where the subscripts a, 6, c, etc., may or may not repeat themselves, 
and then set s; = 0 for all 7, we find 


aa - - - 8M(0,0,0) , 
Sy “Oy a ae If SEAS f (eersae * + +)p(t1,22, . . . 42m) dr 
= (Latnte > - +) (A-23) 


1 F 
The above equation shows that the statistical moments are the corresponding deriva- 
tives of the moment-generating function at the origin. 


Next let us consider the convergence range of the integral on the right-hand side of 
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Eq. (A-23). For purely imaginary values of si, 7 = 1, 2, ...,™, 


MC8s,88y eto) tS fff toe J pees Jar ceg tae GED. 3 siti) | dz 
Hf i=l 
EP a a [ peeves eB he tm) Oe 


Therefore the function M(si,s2, . . . ,8m) is convergent in a region adjacent to the 
imaginary axes of s;. This is the same as the region of convergence for double-ended 
Laplace transforms. Similarly to the Fourier transform, we define a characteristic 
function as 


C(w1,@2, - ere 


i=m 
Jon) = M(jonjion . . - sjem) =< exp () joes) > (APA) 
t=1 


Sometimes C(wi,w2, . . . ,#m) is obtained first, and the density function is calculated 
from it by the inverse transform: 


(1,02, . » » tm) = One ff te if C(wi,w2, . . . 5m) exp (- ea) dw 


(A-25) 


A-8. Mathematical Derivation of the Multivariate Normal Distribution. Presently 
we shall give a heuristic proof of the central limit theorem for a number of correlated 
random variables x1, 22, %3, . - - , tm. The result is more general than that stated in 
Sec. A-6 as the latter is simply a special case with m = 1. Each of the m variables in 
obtained as a sum of N independent trials: 


k=N 
et y Yik 
k=1 


(A-26) 


Bre Bs cal ae ye 


The joint distribution of yiz, yor, . . + » Yme May be different for different values 
k. For agiven k, the distribution of some yi may be independent of others and so 
yz may be zero. Without losing generality, we assume that the mean value of © 
yix is zero. We assume further that there exist limits a > 0 and b > 0 such that 

lyceyik > + > Yul <a” (A-2 
——— 
n factors 
k=N 
> oh > No 
k=1 


The above assumptions are broad enough to suit many situations. For instance, 
the joint distributions of the yix’s of each trial are identical, or if there are a fow 
ferent distributions of yi’s which are repeated proportionately in the N trials ae. 
varies, the assumptions are met. We shall derive a joint distribution function for 
zi’s as N becomes very large. 

As the mean-square values of the 2;’s increase proportionately with N, we d 
new set of variables u; = xi/\/N. The moment-generating function of uw; in gi 
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= < + vA met DD ay vam +0 (— 1) 


il=m 


“ [1 a oy dd sisi(yixyix) + O (= Hh 


k=1 


In the above equation, the second equality sign is due to the independence of 
the NV trials, the fourth equality sign is due to the assumption that (y:,) = 0, and 
rexel /N VN) stands for terms of the order of magnitude of 1/N +/N and Sitio 
Taking the logarithm of the above equation and using the expansion log (1 + B 
=1+2 + (a?/2) +--+ + give 


t=m 
log M(si,82, cee 18m) = log Cexp ( 7: sus) > 
i=1 


k=N ijl=m 
= ph [ sw i SiSr(Yixytx) + O € ai (A-29) 


We note that 
k=N 
1 1 
065-9) = (a) =° 
bi N VN Vn) 


It approaches zero as N approaches infinity. Let the coefficients Cx be defined as 
(uu). From Eq. (A-26), 


1 k=N k’=N 
Ci = << Ox ue) o. mw) > 
s Cysxyuer) 
ved 


Since the different trials are independent, (yixyx’) = 0 fork # k’. The above equa- 
tion becomes 


hy, 
ax 
N 

k, 


Cu = (un) = WV , (Yinytr) (A-30) 


332 SYNTHESIS OF OPTIMUM CONTROL SYSTEMS 


Substituting Eq. (A-30) in (A-29), we obtain in the limit of very large N 


il=m 


te a y 14 Cusisi 


il=1 


log M(si,s2, . . 


The above equation will hold to any desired degree of accuracy by increasing N. 
Alternatively, it can be written as 


2 


l=m 
M(su82, » » « 58) = exp ('Y’ 14Cusisr) (A-31) 
iJ=1 


4 


Correspondingly the characteristic function can be obtained from (A-31) by sub- 
stituting jw; for s;, and the joint-probability-density function is given by Eq. (A-25): 


2 il=m 
Pp (U1, U2, . 2s lm) = ox fff Lil fen(- >» LV Cyrwjor = ¥ jens) dw 
=e il=1 a 
(A-32) 


To evaluate Eq. (A-32), it is convenient to use the matrix notation. Let C denote 
the m X m matrix, with Cy as its element in the ith row and /th column. C is & 
symmetrical positive definite matrix, by definition of Cu. Let ’() and u’(u) 
denote the row (column) vectors with elements w; and wi, respectively. Since C is 
symmetrical, there exists an orthogonal transformation T such that 


TCT =B 


where B is a diagonal matrix. Let B;, 7 = 1, 2, .. . , m, denote the diagonal ele- 
ments of B. Let vectors u and v with elements yu; and »; be defined as 


ge To 
v=T"u 
Then vw =o (T-!) =o0'T 
v=uT 


=m 
3 Ciwiw, = o'Co = o'TT-!CTT“'o 


= u/Bu = > Bip? 
a 


i=m i=m 


wu; = 0/0 = o/TTu = wv = BS Mids 


i=1 i=1 


dw = det |T| du = du 
Substituting the above relations in Eq. (A-32) gives 


i=m 
D(Uita «ss site) = Il xf. exp (Gers — juivs) dug 
i=1 


By a change of variable z; = wu; + jvi/Bi, we obtain 


1 «© 7 i2/2B; © + jvi /By 
Aa Pee eee Pe Re sree 
ae |. exp 6 Baus® = juirs) dus = | pee 


env aay 


en Bata ds 


1 
V 20 Bi 
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Substituting the above expression in Eq. (A-33) gives 


t=m 
= . vi? /2B;i 


e v 


=1 
(2n)mi2 (II Bi) 


P(uUi,U2, . . . Um) = (A-34) 


The expression on the right-hand side is in terms of the new variables v;. To change it 
back to uw: we note that 


i=m 
[] B: = det |B] = det |T- det |C| det |T| 
t=1 


= det |C| 
wm 
iF vB lv = w/Clu 
By 
i=1 
aj=m 
ase 
ait Agjuiu; 
4j=1 


where A is the determinant of C and Aj; is the cofactor of C,;;. (There is no need to 
distinguish Ai; and Aj; since C issymmetrical.) Substituting these relations in (A-34) 
gives 

ij=m 


Pf ‘| 
Un) = Gry" Far exp ( a >: Assn; ) (A-35) 
ees 


19> 


P(u1,U2, eae 


Equation (A-35) gives the density function of what is known as the multivariate 
normal distribution. In case there is only one variable, it reduces to Eq. (A-21). 
In practical work, the density function of Eq. (A-35) and the moment-generating func- 
tion of Eq. (A-31) are just about equally important. 

A-9. Population, Sample, Statistical Inference, Estimate, Maximum-likelihood 
Criterion. In the previous sections, we have been concerned mainly with a deductive 
process: Knowing the physical nature of the chance event or events, what can we say 
about the outcome? However, in many applications, the reverse process is called for: 
Knowing the results of a large number of trials, what can we say about the physical 
nature? 

While the deductive process is safe, the inductive process is highly tricky and unsafe. 
Usually, physical insight plays the main part and statistical techniques play only an 
auxiliary role. Whether the result is worthwhile or not depends on whether the 
physical insight is right, and there is no textbook which teaches us how to gain physical 
insights. We shall confine ourselves to an introductory treatment of the statistical 
techniques. 

Population means the results of a large number of trials of the same experiment. 
Here the word “same” is the most significant. It may mean one of the following: 

Case 1. We isolate all the factors we know which may influence the experiment, 
control these factors so that they are identical in all the trials, and leave only the 
unknown factors to chance, 
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Case 2. We know certain factors that would influence the results of the experiment. 
However, we do not isolate these factors, as we are interested in the statistics of the 
population as a whole. 

Sample means the small percentage of the population that we analyze. 

Inductive inference can be defined as the process of discovering laws or other facts 
about the population with an analysis of the sample as a starting point. In case the 
above-mentioned facts are numerical figures, the process is called estimation. 

There are two points worth mentioning: 

1. The reliability of inductive inference or estimation depends on the selection of 
the sample. If the population is defined according to case 1, the samples are selected 
so that in each sample, all the controlled conditions are met. If the population is 
defined according to case 2, the sample must have as nearly as possible proportional 
representation of all factors appearing in the population. 

As an example of the former, in studying the safety of an airplane in a storm or the 
seaworthiness of a ship, we are interested only in the statistical properties of atmos- 
phere turbulence or ocean waves under stormy conditions. All the samples represent- 
ing calm conditions or insufficiently stormy conditions are discarded. As an example 
of case 2, in forecasting election returns, the samples must be selected in such a way 
that they represent every possible classification of the populace. 

2. In many cases, the word ‘“‘population”’ represents a concept rather than results of 
actual measurement. For instance, in determining the spectral density of missile 
acceleration in rough air, every sample record is obtained at tremendous cost. In 
analyzing the samples, our purpose, in the most restrictive sense, is to find out some 
thing about the spectral densities of the acceleration of missiles of the same design in 
atmosphere of the same description of roughness. It is a conceptual population rather 
than an actual population which consists of no more than the sample records that we 
analyzed. If it were for the latter, we might as well put the sample records on & 
standard signal analyzer and get the exact height of each spectral line. No statistics 
would then be necessary. 

The following example illustrates the process of inductive inference: Knowing the 
exact weight x, 7 = 1, 2,3, . . . , N, of N individuals of a certain sex and age group, 
we wish to determine the following for the entire population of the same sex and age 
description: 


1. Mean weight 
2. Standard deviation 
3. Distribution of weight 


The first two items are point estimates. Let brackets preceded by “‘est”” denote estiv 
mated value: 


N 
1 
est [Z] = — Xi (AW) 
wd, 
N 1 N y 
1 1 
est [(a — #)2] = = (a; — est [Z])? = — za? —(— xy (Ae?) 
wd ae (v2 


The distribution function can be determined on the basis of maximum likelih 
that is, finding the distribution function among all possible distribution fune 
which gives the highest probability or probability density of what already ocett 
Physical insight comes into the picture in restricting or specifying possible distribu 
functions. To illustrate what we mean, we shall study the following three caseat 

Case 1, Since the weight of an individual can be considered as independent of 
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weight of another individual selected at random, Eq. (A-7) gives the probability 
density of all the z;’s occurring as 


[] ?@ = max (A-38) 
The only restriction on P(z) is ; 
| ie P(x) dz =1 (A-39) 
The solution of the above variational problem is readily shown to be 
1 
P@) = N 6(x — 2) (A-40) 
a 


Equation (A-40) states that the distribution function is a sum of impulse functions at 
the measured weights. This is obvious nonsense physically, but it is the correct 
mathematical solution. It leads to the following observation: 

Case 2. Since the weight of an individual fluctuates easily within 5 lb, we arrange 
the weight scale in 5-lb divisions. Conceivably, a man may weigh anywhere from 
50 to 400 Ib, and there are 70 such divisions. Let P(2q) represent the unknown proba- 
bility of a randomly selected individual’s weight being in the range zq + 2.5, and n, 
represent the number of samples x; in the range tq + 2.5. The probabilities P(e, ) 
can be determined by the maximum-likelihood criterion: y 


]] ta)Ira = max (A-41) 
> P@a) =1 (A-42) 

The solution is ‘ 
P(ta) = 54 (A-43) 


which is, of course, a well-known result. 

Case 3. A more sophisticated form of physical reasoning could be as follows: A 
man’s weight can be considered as the net result of a large variety of factors none of 
which has a deciding influence. It is expected to approximate the normal distribution, 


1 
P(x) = re en (a—m)2/262 (A-44) 
Therefore, the joint probability density is 
N 
if N 
P (xj) = ( — ) e~D (@i—m)2/ 202 a 
I V/ Ino , (A-45) 


Differentiating the above expression with respect to m and o and setting the results 
equal to zero, the following results are obtained: 


N 
5. 
t=1 


N 
ey >. (e, — m)* 
fel 


These are exactly the same as Nga. (Ae86) and (A-37), 


m= 


= ll 
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What are the comparative advantages of the three cases? The first case has no 
merit. The second case is the most widely used. It does not require much physical 
insight and is more or less safe. However, a large number of samples are required. 
The third case requires far fewer samples; however, if we extend the result to extremes, — 
we get absurd answers. For instance, there is always a nonvanishing probability 
for negative weight. 

The above example has its parallel in spectral-density measurements. For labora- 
tory tests, we collect a large amount of data and impose no more restriction than in 
the second case. However, for on-the-spot tests, such as in self-adaptive systems, the 
shape of the spectral density is usually assumed fixed except for the magnitude and 
maybe one or two other frequency parameters. 

A-10. Confidence Interval. When a certain parameter of the population is esti- 
mated by averaging over N samples, the sample mean does not necessarily coincide 
with the population mean. The estimation has little value if we do not know how 
reliable it is. A confidence interval is defined as a tolerance range about the sample 
mean which is so selected that the probability for the tolerance range to contain the 
population mean is a predetermined value. As discussed in Sec. A-6, the sample mean 
Vay is distributed normally about the population mean é (same as (v)): 

P(Vav) = Ni e-N WV ay—B)2/20 12 (A-46) 

2r 01 

where a; is the standard deviation of each sample. It is either known or can be esti- 

mated by Eq. (A-37). The probability that |Vay — a] is less than a is 

Pil View oe 3| <a) = | P(Vav) dVay = sae (WVNAYalo) eehde 
ba Vr J —(/N72)(a/a) 


In Eq. (A-47), &(x) is the error function. An alternative form of writing (A-47) is 


a= kyo NE 


where kp is a constant depending on the specified probability. It is tabulated aa 
follows: 


(A-48) 


TaBLe A-1 
kp 


P 

6 0.477 

8 0.906 
1.163 

5 1.386 

9 1.82 

99 282 

Equation (A-48) can be applied to the problem of Example A-1. Since a lay 


number of throws are required, Vay is assumed to be normally distributed, 1 
Table A-1, kp is found to be 1.163: 


yo \? 1.163? X 2.917 _ 
w = 2(*t!) ~ ax 0.52 30 
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The result is considerably more optimistic than that obtained from Tchebysheff’s 
inequality. The discrepancy is there because, while Tchebysheff’s inequality repre- 
sents an absolute bound for even the worst possible distribution, Table A-1 holds 
only if Vay is normally distributed. 

A-11. Error Analysis. One problem which arises frequently in system design is 
that of fixing the tolerance limits of components employed in the system. Let b; 
represent performance parameters of the system, let ay represent parameters specify- 
ing properties of the components, and let the prefix 5 represent a small change; then 


6b; = y — Say, 


Oop 
BL 


(A-49) 


If only a few a,’s are in question, the standard procedure is to consider the worst case: 


|dbs| S y| 


However, if the effect of a large number of independent da’s is being considered, it 
would be desirable to take into account the randomness of the variations, Let oi 
and o, represent the standard deviations of b; and a,, respectively, and oi, the standard 
deviation of b; due to changes in a, alone. From Eq. (A-13), we have 


ab; 

wt = Sgt = S| 
da, 

a & r 


Generally the component parameters are specified by tolerance limits Ky: &® — Ky < 
ay <a + Ky. If P(ay) is assumed to be uniform within the allowable range & + Ky 
and zero outside, o, can be calculated: 


db; 
5a, | ll (A-50) 


2 


ou" (A-51) 


1 &, +K K,2 
a, 2K, Ja ae * (eu — Gu) day, = * (A-52) 
Combining Eqs. (A-51) and (A-52) results in 
1 Ob; |? 
2 = 2 - 
o#=5) | ie | Ke (A-53) 
“ 


A-12. Correlation and Regression. Sometimes two chance events are not inde- 
pendent. If the underlying physical processes are known, this dependence can be 
expressed in terms of the joint distribution function of the two variates. However, 
the reverse is true in most instances, and the dependence of the two variates is estab- 
lished by analysis of the samples. Two related techniques, correlation and regression, 
will now be discussed. 

Let wu and v denote the two related variates, for instance, weight and height of an 
individual, temperature and life of a vacuum tube under rated operating conditions, 
noise amplitudes at one instant of time and at a slightly later time. Each sample is 
expressed as a pair of values of the two variates (ui,v;). A regression line of v on u is 
defined as a straight line on the uv plane with least-mean-square vertical distance 
from the sample points (see Fig, A-1): 


Vv = Cy + byu (A-b4) 
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Regression line 


Fic, A-1. Regression line of v on u. 


The constants c, and b, are determined by minimizing 


N 
DY bi = Ce + dos)? 
t=1 


where N represents the total number of samples. Differentiating the above expression 
with respect to c, and b, and setting the results equal to zero, 
N 
Y bi = + bus) = 0 (A-55) 
i=1 
N 


(A-56) 


D, wiles — + bous)] = 0 
i=1 
Equation (A-55) can be expressed as 
Cy = Vay + Ositlay (A-67) 


where Yay and way are sample average values. Substituting Eq. (A-57) in (A-56) and 


solving for b,, 

3 Ui(v; — Vav) 
———* (A-b8) 
> Ui(Ui — Uav) 


v 


by = 


Let Du; and Dv; represent uj — way and v; — vay, respectively. Since 


Ds (ui = ew) — bs (v; = Dav) =0 


Eq. (A-58) can be written as 
Y, (Dus)(Des)(1/N) Y) Dusy(Dvy)—(/N) J) us(Doi)—A/N) Sve) 


= ase = 
by ae 2 Ou? ou" 


i. > (Dux)? ou 
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Equation (A-59) gives three alternative forms for calculating the slope b, of the regres- 
sion line of v on u. With reference to Eq. (A-57), (A-54) can be rewritten as 


(A-60) 


v— Vay = b.(u 4 as) 


Equation (A-60) shows that the regression line always passes through the center of 
weight (Uay,Yay). 

The physical significance of the regression line of v on wis that, with u considered as 
the independent variable, it represents the best estimate of » as a linear function of u 
in the least-mean-square-error sense. We can also interchange the roles of u and » 
in the above derivation to obtain the regression line of u on v: 


U — tay = buy — dey) (A-61) 


(1/N) = (Dux) (Dv) 


ie (A-62) 


where 


Te" 
Equation (A-61) represents a straight line on the uv plane with least-mean-square 
horizontal distance from the sample points. 

Equations (A-59) and (A-62) give two different values for the two slopes. How- 
ever, neither value is really significant, since both depend on the selection of units of 
Measurement for the two variates. To remedy this defect, w/o, and v/o,y are used as 
the variates instead of u and v, Equations for the two regression lines become 


Vv — Vay ae U — Uav * 
oy = duv oe (A 63) 

UU — Uav c v — Vay f 
PPA £0 sorrel (A-64) 


(1/N) hg (Dux) (Dv;) 
Duv a 


Tuy 


where (A-65) 


The significant point about the two new regression lines is that they now have identical 
slope which is also dimensionless, The slope ¢u» is referred to in the literature as the 
correlation coefficient, as it is a unique measure of the linear dependence of the two 
variates. 


APPENDIX B 


PROPERTIES OF THE 6 FUNCTION 


The term “5 function” is synonymous with “impulse function.” It is defined by 


the following equations: 


d(x) = 0 for all z #0 (B-1) 
x2 Pep! if21 <0 < 22 B-2 
a dg A if zit2 > 0 oe 


The above equations imply that 5(z) has an infinite peak at x = 0. Physically we 
may think of y = 4(x) as a finite pulse with unit area under its curve, and the y dimen= 
sion is stretched to infinity while the x dimension is shrunk to zero in such a way that 
the area measure zy isconserved. Derivatives of 5(x) can be considered as derivatives 
of the pulse function under the same dimensional transform. The situation is illus~ 


trated in Fig. B-1. 


Fra. B-1. Impulse function and its derivative, as « approaches zero. 


Equations (B-1) and (B-2) constitute one of the strictest definitions of 6(x) and al 
derivative operations on 3(x). Sometimes we are not concerned with the derivati 
and Eq. (B-1) can be relaxed somewhat to allow 6(x) to be represented by a wi 
class of analytical functions in their limiting forms. This point will be illustrated 
the end of this Appendix. 

From Eqs. (B-1) and (B-2), the following properties of the 6 function can be re 
shown: 

Li 


ie 5(x) dx = u(t) 
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where a is an arbitrary positive constant and w(t) is the step function. 


2; 
JP terse ar = {9 


where f(x) is an arbitrary function of x and is continuous at x = 0. 
3. Let 5™(x) denote (d"/dx")5(x), and assume that f(x) has continuous derivatives 
up to and including the nth. Then 


f * (@)8(@) de = { (1990) 


Equation (B-5) can be proved as follows: By partial integration, 


rw <0 < x2 
m2. >0 


(B-4) 


a1 <0 <2 


@t2 >0 i 


[2 teow @ az = 1 @pey 7" - [ peosen@ az 


im 
dae 
Owing to Eq. (B-1), 6-(x) = 0 for all c #0. The first term on the right-hand 
side vanishes. Repeating the above operation, Eq. (B-5) is obtained. 

4, 


5(z) = 6(—2) (B-6) 


More precisely, what we mean by the above is that 
zm 0 
[PO ses—2) az = 
v1 


It is readily shown by a change of variable y = —a and then by making use of Eq. 
(B-4). 

Sometimes it is desirable to replace 5(x) by one of its analytical forms to facilitate 
calculation. If such a representation meets both conditions (B-1) and (B-2), with 
well-defined derivatives all the equations (B-3) to (B-6) hold. An example is 


1 <0 < ae 
22 >0 


&(x) = lim —“ e-et#? B-7 

(2) = Jim (B-7) 

50(2) = FO) jim — 22F pes (B-8) 
aL —s) 1. 


However, if the application does not involve derivatives of the 6 function, the require- 
ment of Eq. (B-1) can be relaxed. One frequently used form is 


ma RS Des 
(2) = Jim 5- f°” dt (B-9) 


It is easy to verify that the above expression satisfies Eqs. (B-2), (B-3), and (B-4). 


APPENDIX C 


STATISTICAL ERROR IN MEASURED 
CORRELATION FUNCTIONS 


C-1. Continuous Measurement. With reference to the block diagram of Fig. 7-7, 
yr (t) is the measured correlation function by filtering 2,(¢): 


ar(t) = ii(t — r)é2(t) (C-1) 


yr(t) = i h(t')ar(t — ) dt! (C-2) 
a a(t’) dt’ =1 (C-8) 


(yr(t)) = (ae) = di2(r) (C-4) 
The mean-square error in the measured correlation function is 
eo? = (lyr(t) — bra(r)}2) = (lye) — [dr2(7)P 


Equation (C-5) follows directly from (C-4). Presently we shall evaluate (yr (t)) 
From Eq. (C-2), 


dtr = fy fo” me rme yeaa — than(t — #1) aa 
Since 7(t) is stationary, let t’ — t’’ =». 


(tp(t — t’)ae(t — t!’)) = (er(r)ae(r + )) 
x fj = (i1(O)ia(r)ii(A)ia(r + A)) = R(z,A) (Os 


Equation (C-7) defines the fourth-order moment F(z,)). 


Equation (C-6) can be put into a more convenient form. With the understan 
that h(t) = 0 for all t < 0, Eq. (C-6) can be written as 


(tyr (t)]}?) = RE on At’ )At’) R(t, v— uty dt’ dt” 
es i ss i © MUR — RGA) dt! dd 
. | is ie A(R! — »)R(rA) FAC + NRG, —NI] ard 


Because of the stationarity of i(/), a time translation gives 


(ix (O)ia(r)i(AYEa(e eA) = GAC AYICT = \)HO)ia(r)) 
ade 
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Therefore R(7,A) = R(z7,—2), and Eq. (C-8) becomes 


duel) = f° SO) Rr ar (C-9) 
where fr) = J “(h(t — ») dt + | © h(A( +d) dt 
=2 / "RAG +2) dt = 2 ae h@h(t +d) dt (C-10) 


Combining Eqs. (C-5), (C-7), and (C-9) gives 


ct = [ORD dr — [oal)P (C-11) 
Equation (C-8) gives a general expression for the mean-square error in the measured 
correlation function. 
If z(t) is Gaussian, R(7,A) can be evaluated in terms of the correlation functions. 
Let 1, ©2, x3, and x, represent 7:(0), ¢2(7), 71(A), and i2(A +7), respectively. Applying 
Eqs. (A-23) to (A-31) gives 


R(r,r) = Ci2Cs4 + CisCos + Cul rs 
= [d12(7)]? + dir) b22(A) + dia(t + A)pia(7 — A) (C-12) 


Substituting Eq. (C-12) in Eq. (C-11) gives 
ee? = om fa)[b11(A) b22(A) + b12(t + AVHia(r — d)] ddr (C-18) 


In the special case, integration from 0 to T is used instead of filtering: 


1 
ho = {7 Ot <P 
0 $$ <0 and ea id 


From Eq. (C-10), fa(A) = 2(7 — d)/T?, and Eq. (C-13) becomes 
27 
&s" = Te rk (T — d)[bii(A)h22(A) + bialr + A)Orx(r — AJ dd (C-14) 


Let Ax be a sufficiently large interval such that, for all X > du, the correlation functions 
are negligibly small. If Z >> 1, Eq. (C-14) can be approximated as 


eS z a [611(A) G22(A) + di2(r + A)die(r — d)] dd (C-15) 


The mean-square error is inversely proportional to 7. 


, C-2. Sampled Measurement. Suppose that instead of Eqs. (C-2) and (C-3) we 
ave 


u(t) = Y hnT)e@ — nT) (C-16) 
n=0 
Y hor) = 1 (C-17) 
ned 
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Equations (C-1), (C-4), and (C-5) still hold. | Equations (C-9) and (C-10) are replace 


by * 
(uO!) = Y fi(nT)R(,n7) (C-18 

n=0 
SirlnT) = 2e, y h(mT)h[(n + m)T] (C-19) 

m=0 


where ¢«) = 14, and en = 1 forn #0. Substituting Eq. (C-18) in (C-5) gives 


eg? = : fa(nT)R(r,nT) — [$12(7)]? (C-20, 


For the Gaussian case, Eq. (C-12) still holds with \ = nT’, and Eq. (C-13) is reph 
by 


eg? = > filnT)[ou(nT)boo(nT) + drier + 2T)b12(r — nT)] (C-2 


n=0 


In the special case with 
1 


> <n<WN 
h(nT) = ( O<n 
0 t$<0 t>N 
Equation (C-19) gives 
N-—1n 


Sn(nT) = 2en a 
and 


n=N-1 
ee? = a > (N — nyenlois(nT) poo(nT) + dil + nT )oi2(r — nT)) ( 
n=0 


There are two special cases: 
1. For large N, 


es? 


WD, WulrP gaan?) + bute + nT ouelr — mY) | 


2. For large T such that ¢1:() and ¢22(t) are negligible for all ¢ > 7, the only 
nificant term in Eq. (C-22) is the n = 0 term: 


@gt = 5 (4110) 2210) + [or2l)} 


APPENDIX D 


TABLE OF INTEGRALS 


This Appendix gives a table of integrals of the type 
i 1 je gn(s) 

staat = 7 BONG wry weer, to 
ha(s) = aos” + as" +--+ +a, 
9n(8) = bos?n-2 + b382*—4 +e e+ tba 
and the roots of h,(s) all lie in the LHP. The table was computed by G. R. MacLane, 
following the method developed in Sec. 7.9 of James, Nichols, and Phillips’s book 
“Theory of Servomechanisms.” In their original derivation, the path of integration 
was along the real axis and the condition on h, (s) was that all its roots lie in the upper 
half plane. However, their derivation holds equally well if the path of integration is 
along the imaginary axis and the condition on hn(s) is that all its roots lie in the LHP, 
With the present specification, all the coefficients a and b are real. 

The table lists the integrals J, for values of n from 1 to 7 inclusive: 


where 


bo 
a 2a0a1 
—by + ads 
Ip = a 
2aoa1 


—tabs + ad — Set 
i eee 
2a0(aods — aja2) 


bo(—aias + a2as) — acasb, + agaiby + ads (aods — a,a2) 


2ao(aoas? + a12d4 — 10203) 
"> 2aoAs 


Ms = bo(—aoaaas + ards? + as%as — 20304) + aobi(—azas5 + asa) 


+ aob2(a0a5 — ayas) + aobs(—agas + ayar) 


obs . 
ME as (—aoaias + agas? + ay?ay — ayara3) 
As = do’as? — 2ayaraias — ayarasds + agas?a4 + ata? + 10275 — A1d20304 
Ms 
I, = 
2aAc 


M¢ = bo(—aoasasag + ayasas? — a,2a5? + 2aracasas + a10;04a6 — a\a42a5 — a22as? 


— A2037a5 + G2M304a5) + aob:(—ayasag + G205? + a;2a5 — 30405) 
+ aobs(—ayas? — A305 + ayayasy) + aob3(apasas + @,2a6 — a)a205) 


+ aobs(aoaias — aoas* — a;%ay + ayayas) + ae (ay*ay? + ayayayag 


= Zagayayay = aya yayay + ayaytay — ay*ayay + aytae bh Qyay2ay = ayayayay) 
wah 
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= A053 + 3a G1a3sd6 — 2A A1A4A5? — Aot2dsA5? — Aods°ds + AyA;*Qad5 + a1%a¢" 
— Qayarasag — A12A34a6 + 12ay2a5 + Aid27as” + A10203°A5 — A10 203045 

— M; 

o 2ZaoAz 

= bomo + acbim, + aobeme + + + + + aobeme 

= Ag? 67? — 2AoA1A62A7 — 2A pA2A4A7? + ApA2A 5647 + AphzAsA6” + ApAs7A507 
= aodsds2a¢ + ay2a63 + Baia2G4Ad7 — 2d 1A2G 506? — Aia304A6? — Aid45a7 
+ aiay2asag + a28a7? — 2a22A3h67 — G22A4AsA7 + A27G57d5 + G20304A7 
= 2030405346 + 203706" 

= p47? — ApA5Ae7 — G1A4A6Q7 + A1A5M6? — A2?A;? + 2a2Q3A6Q7 + A204A5Q7 
— G2052A5 — A37Q6? — Ash4?Az + A304A56 , 

= AyA2A7? — AoAs4647 — Apd4AsQ7 + Aods’A5 — A1A2A6A7 + Gidsa6? + 1047a7 


— 41040506 HP 
= —A2A7? + Wad Ged7 + Aodsd4d7 — AyAsA546 — G1?A6? — AiG2d4a7 + te 
2 
= H2A5A7 — ApA1A4A7 — ApAiAsAg — ApA2d3A7 + apds2a6 + ai?asag + A1027a7 
— 21020306 : 
= Ay?A3A7 — Gy2A5? — ApAiA2A7 — ApA1A3A6 + WAidsds + ApA2AsA5 — Aods"A4 
+ a12a2d6 — a12a4? — 102?A5 + 1020304 


teh. (ag2aya7? — 2ao2a3d5Q7 + Ao?as53 — 2apas2acaz + G0A1A2d547 + 3a :A1AsA546 
ay 


— Qayaiayds? + Apd2d3247 — AG203052 — Ayds3ag + aods’asas + ai%d6? 
+ ayarasar — 2aPa2a5a5 — a17A304a6 + G12G42a5 — 1G2?G307 + A1027A5" 
+ A10203245 — 2102030405) j ‘ 
= —ay%a;3 + 309?a agar" + 92020507? + 2ay?a30407? ad 30203050607 a oo ay 
+ ao%as8a5 — 8a0012a62a7 — Badr d2d407? + AyAiA2A A647 + 3a001030506 
= ApGA3Q4Aed7 + Zapaias2asa7 — 2ayaasa52a6 — MoA2*A307? + 2ApA2A3"A6A7 j 
+ AA2G 304057 — Aod2A305°a5 — 0039062 — Acds*a42a7 — Apds*a asa + a\%ag : 
+ 8ar%acasaea7 — 2ar2a2d5A6? — A12A304A62 — Ay7a48a7 + Ay7A47a5A6 + Aid2*ay 
— 2aj22A3A 67 —= 102240507 + 102205245 + G102032A6? + A1d203047a7 
— 10203040506 
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BIBLIOGRAPHY 


In the following chapter-by-chapter description of reference material, the letter and 
number in the parentheses refer to the Bibliography list at the end of the Appendix. 

Chapter 1 is a general introduction. There are some references for the material of 
Sec. 1-4 on the basic relations of linearly compensated systems, namely, Horowitz’s 
work (H3) and G. C. Newton’s discussion of a paper by Lang and Ham (L1). 

References for Chap. 2 are Wiener’s original work (W2), G. C. Newton’s introduc- 
tion of the quadratic constraint to control saturation tendency (N1,N2), Westcott’s 
work using contour integration in the s domain for optimization (W1), and Chang’s 
root-square-locus method (C7). Truxal’s book (T2, chaps. 7 and 8) has both the time- 
domain and s-domain methods. 

References for Chap. 3 are books by Wiener (W2), James, Nichols, and Phillips 
(J1), Truxal (T2), Newton, Gould, and Kaiser (N2), and Smith (83). 

In Chap. 4 the analysis follows Chang’s generalized version of the s-domain optimi- 
zation method (C5). Most of the subjects are also treated in the book by Newton, 
Gould, and Kaiser (N2) with time-domain analysis. Bendat’s book (B6) has a sec- 
tion on the simultaneous filtering of two related signals which treats the same problem 
as Sec. 4-9 with a different method. 

The original work on time-varying systems (Chap. 5) includes Booton’s optimiza- 
tion theory for time-varying linear systems (B14), Zadeh’s work on variable networks 
(Z1), Zadeh and Ragazzini’s extension of Wiener’s theory to systems with mixed 
inputs and finite observation time (Z2), and Davis’s work (D3) using orthogonal func- 
tions. Books by Laning and Battin (L2), Davenport and Root (D2), Bendat (B6), 
and Seifert and Steeg (S1) also have chapters on the subject. References for further 
studies on the subject are papers by Bendat (B5), Miller and Zadeh (M2), and the 
book by Seifert and Steeg (S1). 

The essential background material for Chap. 6 is the z-transform method which 
can be found in any of the following textbooks: Ragazzini and Franklin (R1), Jury 
(J2), and Tou (T1). The optimization technique follows essentially Chang’s three 
papers (C8, C9, and Optimum Transmission of Continuous Signal over a Sampled 
Data Link, 7'rans. AIFE). For further reference, Franklin (F3) had worked out 
earlier a time-domain method, and Kalman gives an interesting extension of the opti- 
mization technique (K6). 

The original work on the subject matter of Chap. 7, power spectra and correlation- 
function measurements, is from papers by Rice (R5), Y. W. Lee (L3), Tukey (T6), 
Press and Tukey (P2), Davenport (D1), and Chang (C3). The book by Blackman 
and Tukey (B10) is perhaps a classic on the analytical aspect of the subject. 

The references for Chap. 8, error analysis, are papers by Huang (H4), Miller and 
Murray (M3), Papoulis (P1), and Truxal and Horowitz (78). 

In Chap. 9, references for Pontryagin’s maximum principle are papers by 
Boltyanskii, Gamkrelidge, and Pontryagin (B13), Gamkrelidze (G1), and Zadeh (ZA). 
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References for “bang-bang” servo are books by Flugge-Lotz (F2), Truxal (T2), and 
Smith ($3) and papers by McDonald (M1), Hopkin (H2), Bogner and Kazda (B12), 
Rose (R7), Doll and Stout (D5), Chang (C6), and Hung and Chang (Hd): General 
references are books by Ku (K10), Cunningham (C17), Cosgriff (C15), and Gille, 
Pélegrin, and Decaulne (G5). 

De for Chap. 10, self-optimizing systems, are papers by Anderson et al. (A1), 
Draper and Li (D7), Lang and Ham (L1), Osder (01), Kalman (K2), Burt (B20), 
Cosgriff and Emerling (C14), Drenick and Shahbender (D8), Genthe (G3), and 
Reswick (R4). at : 

References for Chap. 11, random processes and optimization of adaptive control, 
are papers by Robbins and Munro (R6), Keifer and Wolfowitz (K7), Bertram (B9), 
Chang (C10), Goodman and Hillsley (G6), and others. 

outed fox Chap. 12 are the book and papers by Bellman (B2, B3, B4) and various 
research reports which are unpublished or are in the process of being published. 

General references on mathematics are Cramer (C16), Copson (C13), and Gardner 
and Barnes (G2). 


Al. Anderson, G. W., et al.: A Self-adjusting System for Optimum Dynamic P 
formance, IRE Conv. Record, pt. 4, vol. 7, pp. 182-190, 1958. : 

B1. Barker, R. H.: The Pulse Transfer Function and Its Application to Sampli 
Servo Systems, Proc. IEE (London), pt. IV, vol. 99, pp. 302-317, Decemh 
1952. May > ] 

B2. Bellman, R.: “Dynamic Programming,” Princeton University Press, Prince 
N.J., 1957. } j 

B3. Bellman, R.: Some New Techniques in the Dynamic Programming Solution 
Variational Problems, Quart. Appl. Math. vol. 16, no. 3, October, 1958. 

B4. Bellman, R., and R. Kalaba: Dynamic Programming and Adaptive Proe' 
Mathematical Foundation, TRE Trans. on Automatic Control, vol. AC-5, no, 

. 5-10, January, 1960. ; f 

Bb. Seat Jecbs: Exact Integral Equation Solutions and Synthesis for a 
Class of Optimum Time Variable Linear Filters, J RE Trans. on Inform. T 
vol. 3, no. 1, March, 1957. : 
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Wiley & Sons, Inc., New York, 1958. ; } 
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B10. Blackman, R. B., and J. W. Tukey: The Measurement of Power Spectra, 
System Tech. J., vol. 37, January and March, 1958; also Dover Publica 
New York, 1959. whet ] 

Bil. Bode, H. W., and C. E. Shannon: A Simplified Derivation of Linear 1 
Square Smoothing and Prediction Theory, Proc. IEE, vol. 38, pp. 417 
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PROBLEMS 


Chapter 1 


1-1. Show that, for a linear system with arbitrary configuration, the following 
relations always hold: 


£0 ~(x),0[-¥,]-G),oD-70] 


where C = controlled variable, output of plant 

D, = load disturbance at output end if control loop is open 

N2 = instrument noise in measured value of C 

R = reference input 
C/R, C/N», C/D, = closed-loop transfer functions as designated 
(C/R)(s) = required ratio of C(s)/R(s) such that input to plant is zero 
1-2. In the literature, at least three different system configurations have been 

suggested for adaptive control of an aircraft: (1) conditional feedback, (2) prefiltering, 


woul 


(c) Reciprocal model feedback 


Fra, P 1-2 
bb 
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and (3) reciprocal model feedback. In the block diagrams of Fig. P 1-2, N represents 
a nonlinear or variable-gain element, A(s) represents aircraft dynamics, and H,,(s) 
represents the transfer function of the measuring instrument. 
Prove the following: 
(a) The three systems are equivalent to each other if 


i Ma(s)Ha(s) = M)(s)Gs(s) = G.(s) 


Hdd) = Gils) « ia 


and the same N, A(s), and Hm(s) are used in all three cases. 
(b) In case the elements Ha(s), Go(s), and G@.(s) have very high gain, H,,(s) = in 
and the closed loops are stable; then 


4 (s) &M.(s) for (1) 
§ (s) = M(s) for (2) 
4 (s) = M.(s) for (3) 


(c) Is it necessary for the aircraft response to be linear in order to derive the al 
results? 


Chapter 2 
2-1. Show that, for any arbitrary r(¢), if the only design criterion is 


io v(t) — ct)" dt’ = min 


the optimum system function is 
5 () =1 


2-2. Design an optimum controller that meets the following specifications: 

(a) The integral-square error following a step input of one unit is 0.01 sec or low 

(b) The integral-square value of the input to the plant is minimum. 

The transfer function of the controlled plant is G(s) = 10/s. 

2-3. Repeat Prob. 2-2 for a controlled plant with G(s) = 10/s*. 

2-4. Repeat Prob. 2-2 if, instead of condition a, the integral-square error follo 
a ramp input of one unit is 0.01 see or less and G(s) = 10/s*. 

2-5. Certain types of control systems are acceleration-limited. For conven 


we write 
°0 d2¢ 2 
J (i) o a& 


Determine the optimum control ratios which give least-integral-square errors for 
following cases: 


(a) The input is a step function of 0.1 unit. 

(b) The input is a ramp function of 1 unit. : 

(c) The input is a mixture of (a) and (b) with equal probability of oecurrened, 
condition (1) is to be satisfied, on an average. 


357 


2-6. Assume unity feedback in Prob. 2-5, and determine the open-loop transfer 
functions for all three cases. Make Bode plots of the open-loop transfer functions and 
note the positions of the corner frequencies. 

2-7. Plot the root-square locus for a system with plant transfer function 


5(s + 0.2) 
s?(s? + 0.1s + 0.09) 
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G(s) = 


The desired response time of the closed-loop system is 3 sec, and the integral-square 
value of the input to G(s) following a step input is to be minimized. Determine the 
optimum control ratio and discuss the possible ways of realizing this ratio. 

2-8. A positional control system is represented by the block diagram of Fig. P 2-8, 


Amplifier and 
compensating 
network 


Motor and load 


Fig. P 2-8 
where Kr = 2.1 lb-ft /amp 
K. = 0.3 volt/rpm 
R,. = 2.7 ohms 


n = 3.5 lb-ft/100 rpm motor + load 

J = 0.11 slug-ft? motor + load 
All the above constants refer to the motor shaft. On an average, the load is to be 
moved one-quarter turn at the rate of approximately five times every minute, and the 
gear ratio is 200:1. 

(a) Determine K, and H,(s) to give an integral-square error of no more than 16 sec 
subsequent to a unit step input with minimum heat loss to the motor (use Fig. 2-13). 

(6) What is the armature copper loss under the above-mentioned operating 
condition? 

(c) Plot the transient response of the optimum system. 

2-9. If, in Prob. 2-8, no compensating network is to be used, H.(s) = 1. What is 
the value of K, to give the least-integral-square error with step input? 

2-10. One way. of controlling the roll motion of a missile is to use auxiliary jets 
situated at the tips of the fins to supply the necessary roll moment m(t). The valves 
controlling the jets are actuated by gyro signals in any desired manner. In designing 
such a system, it is desirable to achieve the specified speed of response with the least 
expenditure in ejected gas, the rate of flow of which increases with |m(t)|. As a pre- 
liminary step, we may assume that the optimum system is one which meets the 
condition 


J, to - cr + mimo) at = min 


where & is any given constant, r(t) is the ordered roll angle, and e(é) is the actual roll 
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angle. Given 


1 
R(s) = 5 
Ce) 1 
M(s) Js? 


Determine: 
(a) The optimum (C/R)(s). 
(b) The integral-square value of m(t) versus the rise time tp of the closed-loop 

system. 
(c) The integrated absolute value of m(t) with respect to time versus the rise time t,. 
2-11. In Prob. 2-10, if the reference input consists of ramps and steps instead of 

steps alone, the equivalent reference input function F.(s) can be calculated from 

Rs) Res) = — 4 4% 

What is the optimum form of (C/R)(s)? 
Determine the compensating-network function if series compensation is used. 
2-12. The transfer function of an idealized ship-steering system can be written as 


Ce) _K 
A(s) Ms? 


where c(t) is the ship’s heading and a(t) is the rudder angle. 
(a) With unit step input, find the optimum (C/R)(s) such that 


iam tae [r@) — c@)|? dt = min 
= © a |’ 

a J 0 [ a | <% 
(b) Determine Jo, Ln, rise time tp, and peak values of a(t) and da(t)/dt as functions 
of the Lagrange multiplier k. 
(¢) Express peak values of a(t) and da(t) /dt as functions of tp and parameter K/M. 
2-13. The roll motion of an ocean-going ship is stabilized by a pair of activated fing 
located at the two sides of the ship as shown in Fig. P 2-13a. The fins are tilted by 
hydraulic servo to produce a roll moment in proportion to an actuating signal v(/), 


The hydraulic servo is usually fast enough so that, as an approximation, its time eon” 
stants can be neglected. The equation representing the roll motion is given as 


JO) + 9) + Wot) = Ky) 


where 6(é) is the roll angle and K,v(t) is the roll moment generated by the fins, A 
block diagram of the system is shown in Fig. P 2-13b. The values of the constants ine 
given as 


On = v= = 0.5 radian /sec 


is ; 
iw = 0.04 radian /volt 

Design a control system which has a rise time of 2 see and is optimum to step Inpule 
the sense of Sec. 2-6. Determine the values of the constants K, Ky, and Ky 
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Center of force 


Tilt angle Ship 
motion 


9 


Activated fins 
pre ee 
of a fin 
(a) 
: Hydraulic servo 
Amplifier and fins Ship 


1 


(d) 
Fie. P 2-13 


2-14. Suppose that in Prob. 2-13, instead of roll-rate and roll-acceleration signals 
being used to stabilize the system, a series-compensating network @,(s) is to be used. 
Determine the function KG,,(s). 

In practice, the values of the “constants” change with the loading condition of the 
ship. A certain combination of changes gives a variation in w, of +20 per cent from 
its nominal value but no change in fw, and K;/J. As the compensating means 
go, and K;/J remain fixed, and w, varies from 0.4 to 0.6 radian/sec. What are the 
closed-loop root loci for the series-compensated system? For the shunt-compensated 
system of Prob. 2-13? Which system is more desirable? 


Amplifier and 
network 


Hydraulic servo 
and fins Ship 


o(t) 
Js*+s+lW 


Fia. P 2-14 


2-15. Quite frequently, the plant transfer function Gp(s) can be written as 
G,(s) = Gi(s)G2(s) 


where G(s) represents the low-frequency response of the plant and G2(s) represents 
some resonant modes at higher frequencies. The exact form of Gy(s) is unknown 
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except that G2(jw) = 1 for lower frequencies. One way of designing such a system is 
to use the approximation 

Gp(s) = Gi(s) 
and require that 


ay is [» CC(—s?)* 


pe See Oe — 0) | ds = min 


ae 
The above condition has the effect of limiting the bandwidth of the plant input signal 
a(t) so that the high-frequency resonant modes are not excited. 

(a) Show that, with R(s) = 1/s, the gain of the closed-loop transfer function I /R 
decreases at the rate of 6n db/octave for large values of w. 

(6) Show that, with arbitrary R(s), the input power to the plant, |J(jw)|?, decreases 
at least at 6(n + 1) db/octave at large values of w. 

(c) Given Gi(s) = K/s(1 + 8), use the above criterion to determine (C/R)(s) sueh 
that the closed-loop system bandwidth is 10 radians/sec and the rate of decrease of 
|[(jw)|? is 18 db/octave at high frequencies. 

(d) Make a Bode plot of the closed-loop transfer functions C/R and I/R for the 
system you have just derived. 

2-16. Show that, in a type 0 system with unit step input, the steady-state value of 
e? + ki? is a minimum with the d-c loop gain given by Eq. (2-51). 


Chapter 3 


3-1. The block diagram of a certain positioning system is shown in Fig. P $1, 


d(t) 


Fie. P 3-1 


With control loop open, c(¢) responds freely to the load disturbance, and its spectral 
density is measured to be 

12 
(w? + 9)w? 


What is @4a(jw)? With control loop closed, a random noise of spectral densily 
nn(jw) = 0.1 is introduced into the system. Assuming that r(t) = 0, determine the 
new value of #.-(jw). The noise and load disturbance are uncorrelated. 

3-2. If, in Prob. 3-1, the random noise and load disturbance are Gaussian, what 
the distribution function of c(t)? Does the distribution function vary with tl 
Following a step input r(t) = 10u(t), what is the new distribution function of e() 
various ¢? 


Pec(jw) - 
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3-3. An ensemble of random signals r(¢) is symmetrical about zero: 
P(r()] = P[-r@] 
Signals x(¢) and y(¢) are obtained by rectifying and squaring r(¢), respectively: 


z(t) = |r| 
y® =[r@OP 
Show that 
orz(r) = ory (7) =0 


8-4. A random positional variable 6(¢) has a spectral-density function %99(jw) = 
A*/w*, What is the mean-square value of 6(t)? Is this possible physically? Is it 
physically possible to have a spectral-density function 


; A? 
Poa(jw) = = 


8-5. The amplifier gain K in Prob. 3-1 is to be adjusted for the least-mean-square 
value of c(t). Determine K and the least-square value of c(t). 

3-6. The transit time of an electron in a vacuum tube is of the order of 1079 see. 
Show that, in starved cathode operation (with negligible space-charge effect), the 
spectral density of the fluctuating component of plate current is eZ for the lower fre- 
quencies, where e is the electronic charge and J = ve is the average or d-c plate cur- 
rent. At what frequency do you expect the spectral density to taper off? Why? 

3-7. A function r(t) = 10 + f(t), where f() is a Gaussian random fluctuation with 


10 
Bs ;(Jw) = eee et 


Determine the fraction of time that |r(¢)| < 5 and the number of times per second the 
value of r(t) goes under 5. 

3-8. Determine the autocorrelation functions and spectral densities of the following 
signals. 

(a) Asteplike input of mean-square amplitude 1 with probability P of yn repeating 
the sign of yn—1. 

(b) A series of randomly occurring correlated step inputs as shown in Fig. 3-9 with 


YYien = Q" sin ny 


(c) A series of randomly occurring pulses of the form of Eq. (3-31), where the 
a,’s are uncorrelated and f;,(t) = e~ for t > 0, and f,(t) = 0 for t < 0. 

Assume, on an average, that there are » inputs or pulses per second in all three 
cases. 

3-9. Repeat Prob. 3-8 with the inputs occurring regularly at t = n/v, where n is an 
integer. 

8-10. With reference to the block diagram, the spectral densities of i; and 72 are 


1 
Fic 
Py = by = 0 


What are the spectral densities 5), Py, and 9, of the output signals? On the other 
hand, given My), Dy, and M4, can ,,, Dye, and &» be determined? 
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3-11. Determine the cross-correlation functions and cross-spectral densities of the 
following pairs of signals: 


(a) sin w:t and A sin (wit + $) + n(t), where ¢ is a constant and n(t) is uncorré= 
lated noise. 


(b) sin wit and |sin wil]. 
(c) sin w:t and periodic function f(t) = f(t + 2mr/w). 
(d) Zanf(t — tr) and Zang(t — tn + to), where f(t) and g(t) are known functions of 


time, a, is evenly distributed with mean-square value a?, and ¢, occurs at random with 
average frequency ». 


(e) Zanf(t — tr) and Zlan|g(t — tr + to). 


(f) r(t) and r(t)a(t), where r(t) and a(t) are uncorrelated random signals and a(t) 
has zero mean. 


3-12. A missile in horizontal flight can be represented approximately by the blook 
diagram, where 0(¢) is the heading angle and 4(¢) is the angle of deflection of the con. 
trol surface. Suppose that the disturbing torque due to atmospheric turbulence 
can be represented by randomly applied torque impulses of root-mean-square mail: 
tude 1 and zero mean and that there are, on an average, v independent impulses par 


second. The control loop is open at ¢ = 0 (omit the broken lines) [8(t) = 0 fort > 0), 
with correct heading. 


(a) Determine the expected rms heading error at t = ft, sec. 
(b) Under what condition would the heading error approach normal distribution? 


d(t) 


Kia, P 3-12 
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3-13. From Prob. 3-12, it is obvious that the heading error of an unguided missile 
operating within the earth’s atmosphere increases indefinitely with the time of flight. 
Suppose that a gyro is used to sense the heading error and to operate the control sur- 
face proportionately, as illustrated by the broken lines of the block diagram of Fig. 
P 3-12, with G.(s) equal to a constant K. The gyro error can be represented as noise 
with ®n.(jw). Asasimple approximation, we assume that ,,(jw) is a constant within 
the servo bandwidth and that the desired heading 6;(t) does not change with time. 
Let e(t) = 0:(t) — @(t). Determine for large ¢ (stationary situation): 

(a) Spectral density of the heading error. 

(b) Mean-square value of the heading error. 

(c) The optimum value of K to give the least-square value of the heading error. 

Assume for computation purposes the following numerical values: 


Tm = 0.2 
Kn? aa = 5.76®an 


Chapter 4 


4-1. With reference to the block diagram, r(¢) and n(t) are uncorrelated, and 


3,600 
Pr(Je) = “32 + 169) 
Brn (Jw) = i 


Determine H(s) such that [r(t) — c@)]? is minimum. What does it represent 
physically? 


Fia. P 4-1 


4-2. An optimum position-control system in the presence of load disturbance is to 
be designed. With reference to the block diagram, the mean-square value of a(t) is 
limited. The spectral density ,,(jw) of r(t) is 1/w‘, and the spectral density ®aa(jw) 
of wind-induced load torque is A?/(w? + »?). Determine G.(s) and H(s). What 
physical elements do they represent? Neglect n(¢) in the present problem. 


Fia, P 4-2 
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4-3. A platform-stabilization system has essentially the block diagram of Fig. 
Pp 4-2. However, the torque motor usually has sufficient torque a(t) and is not a 
serious limiting factor on the system’s performance. In its place the noise n(t) in 
the error-sensing element is an important factor to be considered. 

Assume 


bi, = 0-002? 
ae ot 0.01 
®,n = const = 107!° 


Determine: 

(a) Optimum form of G-(s). 

(b) Mean-square error. 

4-4. For the idealized missile-control problem of Prob. 3-13 determine the optimum 
form of G(s), assuming no constraint on the control surface movement. Calculate 
the least-square values of the heading error and the control surface displacement 
[5(¢)]?. 

4-5. Repeat Prob. 4-4, assuming that the mean-square value of the control surface 
displacement is limited. Obtain expressions of the optimum form of G.(s) in terms of 
the Lagrange multiplier ?. 

4-6. The roll motion of a ship can be stabilized by a pair of activated fins as shown 
in Fig. P 2-13a. The motions of the fins are opposite to each other so that a net 
torque is exerted on the ship to neutralize the roll moment exerted by the ocean waves, 
The stabilizing moment due to the fins is given as 


m,(t) = 146wKz[(a1 + Bi) — (a2 + B2)] 


where w is the distance between the centers of force on the fins, Kz is the lift coefficient, 
a and a» are tilt angles of the fins, and 6: and #2 represent the instantaneous dined 
tions of flow at the two fins. Besides the stabilizing moment, there is a disturbing roll 
moment ma(t) exerted on the ship by the ocean waves, and our problem is to use my(t) 
to neutralize the effect of ma(t) as much as possible. There are two ways of doing thint 

1. Tilt-angle control. The tilt angles a: and a: of the two fins are varied according. 
to the error signal e(/) and its derivatives, as shown in Fig. P 4-6a. The hydraulia 
servos are assumed to be perfect, and 


2. Lift control. The lift forces on the two fins are measured by measuring the 
deflections of their shafts. Two hydraulic servos move the fins in such a way that 
their lift forces are varied according to e(t) and its derivatives. The part of the ote 
trol system of Fig. P 4-6a enclosed by broken lines is replaced by two identionl 
branches, one of which is illustrated in Fig. P 4-6b. ‘The hydraulic servos are assumed 


to be perfect, and 
I, = -—L, =L 


(Nore: The variations in 6; and #2 are slow in terms of the response time of & 
hydraulic servo. Consequently the fins move with the flow to give the “ordered lift" 
+L.) 

Assume that the following spectral densities are known: 


$,4(jw) = spectral density of disturbing roll moment 
p,6,(jw) = %p,8,(jw) = &pa(jo) = spectral density of variations in flow direction 


f,p,(jw) = Pap,(jo) = Pup, (jo) = O 
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Derive expressions for ©o,9,(jw) and ®zz(jw) in terms of the above spectral densities 
and transfer functions G(s) of the ship and (C/Ma)(s) which gives minimum [6,(é)]? 
while [L(t)]? is limited. Do the above problems for both systems. 


Linear 
controller 


(a) 


Ordered 
L Hydraulic 
valve and 
motor 


(0) 


Measuring device 


Fie. P 4-6 


4-7. For the (C/M.)(s), ®gg(s), and ®za(s) of Prob. 4-6, compare the relative merits 
of “tilt-angle control” and “lift control,’’ as described in Prob. 4-6, in the following 
aspects: 

(a) How well is the roll motion “stabilized” as represented by 0,6,(jw)? 

(b) Stress on the fin shafts, as represented by ®z1(jw). 

(c) System’s sensitivity to change in ship speed. 

Note that G(s) does not change much with speed (see Prob. 2-13) but Kz is approxi- 
mately proportional to the square of the ship speed. 

4-8. Quite frequently, by using physical intuition, a good approximation to the 
optimal system can be obtained without exact knowledge of the spectral densities. To 
illustrate this point, we shall indicate an approximate solution of Prob. 4-6, using fin 
lift control. With reference to the block diagram, it is physically obvious that most 
of the roll motion is induced by disturbing moments near w,, and the essential function 
of the stabilizing fins is to damp out the resonance, Since the spectral components in 
ma(t) at other frequencies are not ne significant, we make the approximation ®yy(ja) 
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@a(jwo) in calculating the optimum form of (C/M.)(s). Show that 
Cc (8)opt = Eid yatehe: ea 
Mao" I (8 + kiwis + w:) 


where w: is determined by the allowed value of [L()]*, and ¢, is given as 


The controller can be realized by the block diagram of Fig. P 4-8b. Determine K 
and K,. 
Find expressions for the mean-square values of 0() and L(t). 


Ordered L 


System Hydraulic 
controller we 


Ordered L 


(b) 
Fic. P 4-8 


4-9. An input signal consists of a desired signal r(¢) with ?,(jw) = A?/(w* + a’), 
and uncorrelated noise n(t) with ®nn(jw) = B®. Determine the optimum filter to give 
r(t + a) with least-mean-square error. 

4-10. Design an optimum predicting differentiator which gives a signal closest 
(d/dt)r(t + 1) in the least-square sense. The input-signal and noise spectral densi 
are given as 


< 10 é 
@,(jo) = Py) Gnn(jo) = 1 
4-11. For the problem of Sec. 4-9, determine the optimum filtering functions 
G(s) and G2(s) to give r(t) and 7(t), respectively, if the known functions are 


(a) rift) = r(t) + mlb) (b) rit) =r) + ni) 
ro(t) = 7) + nell) ro(t) = F(t) 4+ nal) 

E A? ’ 
Prins (Jw) eageTT TO TAT Primi (Jw) = aa 


w? + a? 


Pryny (Jo) = 1 Prsny (Jw), = 1 
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4-12. The method of Sec. 4-9 does not require any prior knowledge of the statistical 
properties of r(é). Sometimes r(t) is random with a known spectral density ®,r(jw), 
and independent filters G(s) and G2(s) are used to give the minimum value of 


Ir(t) —e()}? 
Show that G.(s) and G2(s) satisfy the following equations: 


Gii,i, + G2Pi,i. — Bir = Xi 
GiPigi, + G2Pizig — Bigr = Xe 


where X, and X» are analytic in the LHP including the imaginary axis. 


Chapter 5 


5-1. With reference to the block diagram, ®nn(jw) is a constant v, and c(t) = 0 for 
allt <0. Att =, the switch is closed. Find an expression for hec(t,t’), where both 
it and t’ are positive. Compare the result with that of Prob. 3-12. 


n(t) - K c(t) 
s(1+Ts) 


Fra. P 5-1 


5-2. The switch in Prob. 5-1 is opened after it has been closed for f1 sec. Show 


that for ¢ >t: 
c(t) = Cy + Cre? 


Determine the distribution functions of C, and C2, assuming that n() is Gaussian. 
5-3. With reference to the block diagram, the switch is open for ¢ < 0 and c(t) 


Fia, P 5-3 
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respond to d(t) freely. At ¢ = 0 the switch is closed. Given ®c4(jw) = 1, determine 


®.c(t,t’). 
5-4. The capacitor C in the circuit diagram is a constant Cy for ¢ < 0 but increases 


linearly afterwards: 
C=C4+ Cit 


Show that the impulse-response function is 


Fe (Co + Cits) ROO + Cxt)—OHREYD E> >O 
0 ift<t 


h(t,) = 


Find also the expression of A(t,t:) fort > 0 > th. 


R 


Fie. P 5-4 


5-5. For Prob. 5-4, determine the correlation functions between (a) e) and e) and 
(b) eo and e; if e; is a white Gaussian noise with &,¢,(é,t’) = 6(t — 0’). 
5-6. Repeat Prob. 5-5 with white Gaussian noise of increasing intensity. 


2 0 fort <0 
deeltt) = (Cor + Cat —t) fort >0 
5-7. Asignal r(t) = A sin (wot + ¢) with unknown A and ¢ is masked by Gaussian 
noise with spectral density ®n,(jw). The desired optimum filter should give r(é) 
unerringly if the noise power is negligible, and least-square error in general. The 
observation time is 7’ sec. Determine g(r) of the optimum filter for the following 
cases: 


5 B 
(a) Pin(jw) =B (b) Brn (Jo) mes wo? +b? 


Determine also the mean-square error for case a. 
5-8. Repeat Prob. 5-7 with 
r(t) _ Ao + Ait 
Pan(jw) = w? 


The desired output is r(f + a). Both Ao and A; are unknown. ! 
5-9. Show that the necessary and sufficient condition for a set of normalized eigen: 
functions U,(t) to be complete is 


x Un(t)Unlr) = 8@ — 1) (1) 


for all values of ¢ and 7 in the interval ab. 
From Iq. (5-99) and (1) above show that 


Kit’) = YE UnWU lt) 
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Given K(t,t’), is the set of eigenfunctions U,(t) uniquely defined? Given the set of 
eigenfunctions U,,(t), is the kernel K(é,t’) uniquely defined? 

5-10. A random signal r(¢) can be expanded into a set of orthogonal functions U,,(t) 
in the interval a <t <b: 


r(t) = = CU p(t) 


where the coefficients C;, are random. By some general argument, using the vector- 
space concept, it can be shown that there exists a set of U,,(t) such that the coefficients 
C, are uncorrelated with each other (Karhunen-Loeve theorem): 


CrCn = 0 
for all pairs n, m with n ~ m. Our problem is to find this particular set of U,(t). 


Show that cm 
Grills’) = YY CAV aU at’) 


n 


and U,(t) is an eigenfunction of the integral equation with orr(t,t’) as kernel, 


Chapter 6 


6-1. With reference to Fig. 6-1, the known transfer functions are 


ee 

s2(1 + s) 

1 Poe e7Ts 
8 


G(s) = 


Gi(s) 


and 7 = 1. Determine K(z) and D(z) so that the following conditions are satisfied 
after a minimum number of sampling instants: 

(a) Errorless at sampling instants only for a unit step input. 

(b) Errorless at all times for a unit step input. 

(c) Errorless at sampling instants only for a unit ramp input. 

(d) Errorless at all times for a unit ramp input. 

Calculate also e:(t) and e3(t) for all four cases. 

6-2. In Prob. 6-1, the D(z) of the system is designed for condition d. Determine 
ei(t) and e3(t) when a unit step input is applied. Also calculate ei(t) and e3(t) of a 
system designed for condition b but with a unit ramp input applied. 

6-3. Repeat Prob. 6-1b but allow two additional sampling intervals so that the 
magnitude of e3(t) can be reduced. 

6-4. Repeat Prob. 6-16 for a sampling interval T = 2. 

6-5. Repeat Prob. 6-15 and d for a plant with transportation lag: 


5e71-58 
#0 Fa) 


1 Ii: 4 i} 
2a la = a | " Sneida ” [ a = a | 


6-7. Given the functions 


G,(s) = 
6-6. Show that 


Ge) = Ris) = : 


1 
a(1 + Ts) 


‘ 1 
D(jw) = rie wif 
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calculate 
(a) G(2)G(e) (6) R(z)R(z) 
() GG@) (d) RR@) 
(e) GR@) (f) GR) 
(g) &(2) (h) G&(z) 


6-8. With reference to Fig. 6-1, the transfer functions G,(s), Gp(s) are known and 
D(z) is to be designed to give the least-integral-square value of e:(t) following a unit 
step input in r(t). There is a constraint on e3(t) of the form 


Y teary < a? 


n=0 


Determine D(z) for the following cases: 


(a) Gi(e) = 2 ) G(s) = 
G(s) = = G0) = 4 


6-9. The following are expressions for sampled spectral density 2(z): 
(a) 1 (b) 2 ear ae 
“T+ 0.52-)(1 + 0.52) (2 +0.5)(2 — 2) 


Determine ¢2(n7') for each case. 
6-10. With reference to Fig. 6-10, the known functions are 


lis) = pg 
F(s) =1 n(t) =0 


and G(s) is to be selected such that [r(t + a) — c(é)]? is a minimum. Determing 
G(s). 


Chapter 7 


7-1. The power spectrum of a Gaussian random signal of 10-cps bandwidth is (@ 
be determined from N independent sample records of 5 min each. The desired fre 
quency resolution is 0.2 eps, and the rms error is to be no more than 5 per cent. 
is the minimum number of N? 

7-2. Consider two signals 

u(t) = i) + w(t) 
v(t) = u® 


where 7;(t) and z(t) are independent and Gaussian. The cospectrum of u(() and 
v(t) is measured as shown in Fig. 7-5a. Find an expression for the rms error of the 
cospectrum. 

7-3. Suppose that the bandpass filter of Fig. 7-4 is not the ideal one shown but haw 
the following characteristic: 
1 


|H (jon) |? = i+ On)! 


where w, = o» + nA. The constants w and A are the center frequency and spaciiy 
the spectral components, respectively. Determine the rms error of the mens 
spectrum, 


PROBLEMS Ab 


7-4. In Prob. 7-1, what is the required N if there is to be 90 per cent confidence 
that @(jw) is within +5 per cent of the measured value? 

7-5. In Prob. 7-1, what is the required N if there is to be 90 per cent confidence 
that every point of the 50-point spectrum is within 5 per cent of its measured value? 


Chapter 8 


8-1. Determine the transient-response sensitivity function with respect to a, 
where a is a negligibly small constant. Step inputs are assumed. 


2 a 
© FO) = Fa +20 +8) FO) "TF @+ ale + ast 
a 82(1 + as) _ _ G(s) 
OrO-Traaraeta FOL aG) 
1 + 28 
Ge) = s? + as 


8-2. From the following system equations, determine for each system whether it has 
no parasitic oscillation, transient parasitic oscillation, or sustained parasitic oscilla- 
tion, where a is a small positive constant approaching zero. 

(a) D(s) = as? + 38 + (2 — a?) 

(b) D(s) = a*s4 + 2a2s3 + (1 + a)s? +58 +4 

(c) D(s) = —as? + s? + 0.2s +1 

8-3. Find the approximate solutions of the following differential equations to the 
first order of a. That is, 


x(ta) = x(t,0) + 
(a) €+ (1 — axv)t+2 = ult) 


02 (t,a) 
da a=0 se 


(b) a@ + 4% +23 = d(t) 


where u(t) and 6(t) are the unit step and unit impulse, respectively. 
8-4. With reference to Fig. 8-1, the transfer functions are given as 


H(s) = Kis 


RK, 
Gs) = Sq 4 Ts) 


The nominal values of K,, Kz, and T are K, = 1, K, = 2, and T = 0.25. Determine 
the locations of the closed-loop poles and the sensitivities of these poles with respect to 
variations of K,, Ke, and T. 

8-5. Determine the nominal value of the damping factor ¢ of the control poles (the 


pair of complex conjugate poles nearest the origin). Assume equal tolerances for 
Ky, Ke, and T: 


What is the largest value of » which still satisfies that ¢ > 0.3 for any combination of 
errors in Ky, Ky, and 7? What is the worst combination in Ky, Ky, and 7 (giving 
lowest ()? Check your result by either root-locus plot or direct factoring. 
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8-6. Repeat Prob. 8-4 with the following nominal values of Ki, K2, and T': 


Kk, = 0.4 
Kz = 20 
T = 0.04 


8-7. Show that, at a multiple pole of the control ratio (C/R)(s), the pole sensi- — 
tivity is infinity. Does that mean that the system is unstable to small component 
variations? ‘ : ‘ 

8-8. Investigate the pole sensitivities of a system with a block diagram as shown in 
Fig. 8-1, H(s) = 0, and 

K(s + a) 
8?(s? + 2fwos + wo?) 


The nominal values of the constants are 


K =15 


w =4 


G(s) = 


a=0.6 
¢ = 0.875 


Plot the root locus of the system with nominal values of wo, a, ¢, and changing value 
of K. ; 

8-9. With reference to Fig. 8-1, the transfer function of a lead-compensated system 
is given as 


1+05s K 
@¢) = 7 00nes + FO) 
H(s) =0 


The nominal values of K and T are K = 14 and T = 0.5 sec. Determine the loohe 
tions of closed-loop poles, peak response, and maximum modulus (for nominal values 
of K and 1’) and their sensitivities with respect to small variations in K and 7’. 

If K and T are to have the same percentage tolerances and the maximum modulus i# 
not to exceed 1.4 for the worst combination of errors in K and 7, what percentage 
tolerance can be allowed? Check the last result by making a complex-plane plot, 
and determine the maximum modulus from the plot or by direct calculation. 

8-10. In Example 2-2, the time constants of the actuating solenoids were not taken 
into consideration. These can be represented as 


1 


(8) = TF 0.1 6/en) + 0.008(6/a1)* 


Consider G(s) as part. of G.(s) and calculate the increase as well as percentage incronae 
in the cost function. ‘ 
8-11. In an idealized steering system, the transfer function from rudder rate y (0 
ship heading y is 
T(s) 
Y(s) 


In a problem with the rms value of the rudder rate y(t) limited, the cost function i# 
assumed to be of the form 


fi cond @ ate ky? 
The spectral density of the desired heading is 


. A 
(Jw) = w? 
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Show that the optimum control ratio is 


wo? 


c (s) = 
R = (s + wo) (s? + 2faos + 0) 


and ¢ = 0.5. 

If the specified limit on peak overshoot is 3 per cent, what value of ¢ should be used? 
What is the percentage increase in the cost function with the new cr 
If the specified limit on peak overshoot is 20 per cent, what value of ¢ should be 
used? 

8-12. For the deadbeat system of Example 6-1 calculate the sensitivity of the sys- 
tem’s response to ramp inputs with respect to the gain of G(s). 


Chapter 9 
9-1. A nonlinear system is represented by 
1 = 222 
do = —3x2(x1? + 22%) + y ly| <t 


For a minimal-time problem, determine the equations to be satisfied by A. (¢) and 
Az(t) of Eq. (9-15). 


9-2. For Prob. 9-1, if, instead of the minimal-time requirement, the following inte- 
gral is to be minimized, 


b 
J =f. (y? + a1? + 22%) dt 


where a, b are fixed terminal points in the phase plane 21, x2, rewrite the system equa- 
tions with a new independent variable 7 so that the problem is to arrive at b with 
minimum 7. Determine the equations to be satisfied by \;(7) and Aa(r). 

9-8. A control system is represented by 

2, 
Tie tt aT Ws Tm 

Determine the optimum switching boundary in the phase plane of the system error 
e(t), subsequent to a step input. Assume that J /Tm = 2 sec?/radian and [Tm = 
0.5 sec/radian. Is it possible to normalize the equation into the form of Eq. (9-29) 
with 7 = 1? 

9-4. In Prob. 9-3, the constraint on f(d is changed to 


—0.5Tm < ft) < 27 n 


Does the optimum form of f(é) still take only the limit values? If so, why? Deter- 
mine the optimum switching boundary in the phase plane of the system error e(t). 
9-5. Convert the following transfer functions to state variable representations: 
10 25(s + 2) 
G SS ——  ___ Ee 
(a) Gs) 8(s? + 2s + 2) (6) Gs) s3 + 4s? + 10s + 2 
_ —8? + 20s + 10 
(c) G(s) = wee 


9-6. To reduce noise in measurement, the error e(t) is prefiltered before its applica- 


tion to the phase-plane controller, as shown in Fig. 9-15a. The filter function is 


10,000 


A) = Tide + 10,000 
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The controlled system is the same as given in Prob. 9-3. Plot on the phase plane the 

required displacement of the optimum switching boundary to compensate filter delay. 
Chapter 10 


10-1. Show that the result of Example 10-1 can be generalized as follows: If, 
instead of the original functions, 


K 
G(s) = wae 
A? 
®,, = wo +a? 


the optimum system can still be represented by Fig. 10-2 with K/s changed to 
K/(s + a) and 
1 A? 
Ke= x # +at—a) 

10-2. Show that the actuating error of Prob. 10-1 (input to Ka) has white spectral 
density. How does the spectral density change with an increase in Ka? Find a 
method of adjustment that converts the system into a closed-loop adaptive system. 

10-3. Discuss the possible ways of implementing adaptive control for the roll-stabili- 
zation system of Prob. 4-8, assuming that only $aa(jon) varies from time to time, 
Investigate the stability of the adaptive loop in case a closed adaptive loop is used, 

10-4. In the problem of Example 10-4 find two linear combinations of the phase 
error and gain error signals that can be used to correct £ and w, independently, at 
least for small deviations from the nominal operating point. ; 

10-5. A certain controlled plant has a transfer function G(s) which drifts slowly 
but erratically with time: x 


Gols) = SF Fis) + To)? 


The ranges of value of 7;, T2, and K, are 


0.5<7T1 <3 
0.01 < T. < 0.05 
2<K<4 


The units are in seconds and second—, respectively. The desired control ratio is 


(z), oe 1 a 


In order to accomplish the above, an arrangement similar to Fig. 10-9 is used: 


a(t) = c(t) 
M’(s) = G(s) = 1 
H(s) =1+8 


and K of the variable-gain amplifier is automatically adjusted. As an approximation, 
we may assume that the phase margin of the closed-loop system is 25", Make Bode 


ip. Ae eee the range 0.1 <w < 2 for two extreme combinations 
plots of the ratio (C/R)a(ju) in g <w< 


of T;, 7s, and Kg which give a ratio closest to 1 and a ratio farthest from 1, 
,’ ? 
respectively. 
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10-6. With reference to Fig. 10-2, the transfer function of the plant is (instead of 
K/s) 
G(s) = tent 
Js? + ns 
The signal r(¢) is a ramp input Bt and the noise n(t) has a power spectrum ®nn(jw) = 
a, The adaptive controller adjusts K, so that the total mean-square error under 
steady-state operation is a minimum. Determine Ka as a function of B and o?. 
Work out three ways of adjusting Ka: 

(a) Open adaptive loop. 

(6) Closed adaptive loop. 

(c) Draper and Li’s optimalizing control. 

Discuss their relative merits. 


Chapter 11 


11-1. The power spectrum of a random signal is known except for a constant: 


: K 
P(jw) = ea 
Calculate the required interval of measurement 7' so that the expected rms error in 
measured value of K is 1 per cent. Suppose that the functional form of 7 is unknown 
and that the frequency resolution is Aw = 0.2 radian/sec. What is the required 7 
for a 1 per cent rms error? 

11-2. With reference to Fig. 11-1, h(r) and H( jw) are to be determined by methods 
1 and 3, correlation measurement and cross-spectrum measurement, respectively, 
described in Sec. 11-4. Suppose that ;; = 10, Pan = 1, H(s) = 1/(1 + 8), Aw = 
0.1 radian /sec, and the measuring interval 7 is 1 min. Determine the expected mean- 
square errors for h(r) and H(jw). Compare the two methods. 

11-3. In Example 11-1, the variation of the situation variable is assumed to be 
steplike. For ramplike variations of the situation variable, 


: 20C 
Prgrg(Jw) = OF atal 
and $55(z) = A 


determine the optimum weighting factors w; and minimum value of D. 
11-4. Obtain a solution for the derivative-sensing systems similar to that of Exam- 
ple 11-2 but with the more general form of 22, (2) as given by Eq. (11-54). 


11-5. Repeat Prob. 11-4 for an alternative-biasing system. 


INDEX 


Adaptive control systems (see 
Self-optimizing systems) 
Aircraft pitch control, 88-90 
Amplitude constraints, power 
constraints, 67-68, 118-119 
control of system bandwidth, 68 
heat loss, 67 
saturation, 67, 217 
Anderson, G. W., 263 


Bang-bang control, 9 
(See also Minimal-time control; Pre- 
dictor control) 
Bellman, R., 313 
Bendat, J. S., 347 
Bergen, A. R., 130 
Bertram, J. E., 180, 292 
Bogner, I., 225 
Boltyanskii, V. G., 217 
Booton, R. C., Jr., 105 
Butterworth function, 31 


Central limit theorem, 328 
Chadwick, J. H., 170 
Chang, S. 8S. L., 347, 348 
Characteristic function, 330 
Computer control, 309-311 
digital feedback system, 311 
input control, 310 
programming and perturbation 
correction, 310-311 
Conditions of physical realizability, 
continuous system, 14, 65-66 
sampled-data system, 129-130 
Confidence interval, 836-337 


Control effort, minimization of, 12 
Control ratio, optimum, for random in- 
puts, 70, 74 
for sampled-data systems, 139, 
152, 158 | 
for step and ramp inputs, 20, 21 
poles of, 20 
Correlation function, auto-, cross-, 41, 
102-103 
basic properties, 41 
definition, 41 
calculation of, 51-59 
measured, 164, 342-344 
mean-square error, in continuous 
measurement, 342-343 
in sampled measurement, 
343-344 
scattered nature, 164 
measurement of, 175-180 
by analog method, continuous cor- 
relator, 178-180 
pulse-type correlator, 180 
by digital method, 175-178 
choice, of maximum correlation 
time, 176 
of sample length, 176 
of sampling period, 175 
Cosgriff, R. L., 348 
Cost function, 36, 39, 207-215 
of a nearly optimum system, 209-215 
continuous, 209-213 
sampled-data, 214-215 
physical significance, 207-209 


Delta function, 340-341 
Digitized maximum principle, 317-324 
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Digitized maximum principle, com- 
putation procedure, 320-324 
Digitized system equations, general, 
313 
Distribution, continuous, 325-326 
Gaussian, 37 
normal, 38, 328-329 
multivariate, 333 
Poisson, 57 
Draper, C.8., 273-301 
Dynamic programming, 313-317 
application to minimum-cost 
problems, 315-317 
invariance imbedding, 315 
principle of optimality, 314 


Eigenfunctions, 122-123 
completeness, 123 
Higenvalue, 122 
degenerate and nondegenerate, 123 
Ensemble, average, 49 
probabilistic, 50 
Ergodic hypothesis, 40 
Error analysis, 192-208, 337 
Errorless and rippleless systems (see 
Sampled-data systems) 
Expected value, 325, 326 


Finite observation time, optimization 
of systems with, 105-122 
and power constraint, 118-122 
time invariant, 107-118 
Flugge-Lotz, I., 348 
Franklin, G., 347 
Fredholms’ equation, of first kind, 105 
of second kind, 122 
solution for special case, 112-116 
Functional constraints, continuous sys- 
tems, 65-66 
sampled-data systems, 129-130 


Ham, J. M., 270 
Hold systems, zeroth and first-order, 
128 


Hopkin, A. M., 225 
Horowitz, I. M., 347 


Identification problem, 283-289 
approximate stationarity, 283 
determination of impulse response or 

transfer function, by measuring 
spectrum and cross spectrum, 
286-289 

by use, of sinusoidal test signal, 

285-286 
of white noise as test signal, 285 
use of prior knowledge in determina- 

tion, of power spectrum, 284 

of transfer function, 288-289 

Integral-square error, 11 
minimization of, 11-17, 184-139 


Kalman, R. E., 347 
Kazda, L. F., 225 
Keifer, J., 289 

Ku, Y. H., 348 


Lang, G., 270 

Law of large numbers, 326-327 

Lee, Y. W., 263, 347 

Li, Y. T., 273, 301 

Likelihood of saturation, minimizing, 
22 

Linearly compensated systems, basic 
relations of, 5-7 

Load disturbance, 4, 74-77 


McDonald, D. C., 225 
Maneuvering problem, 216 
Maximum principle, 217-222 
Maximum range problem, 312 
Miller, K.S., 105, 198 
Minimal-time control, 216-254, 312 

of linear system with saturation, 

222-254 
(See also Predictor control) 
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Minimum-cost problem, in fixed time 
interval, 312-313 
between terminal points, 312-313 
Moment-generating function, 329- 
332 
Multiple constraints, systems with, 
82-86 


Newton, G. C., 2, 68, 347 
Noise, 4, 78-82 
optimum filtering of, 78-82 
sinusoidal noise, 78-80 
white Gaussian noise, 80-82 
Nonstationary random process, 40 
Nonstationary systems, classification 
of, 100 
optimization of, 104-105 


Optimalizing control methods, 273- 
280 
continuous test signal, 277-279 
output sampling, 276-277 
peak holding, 279-280 
sensitivity signal input, 274-276 
(See also Self-optimizing systems) 
Optimization theorem, 71-73 
Optimum filtering and prediction, 
92-98 
mean-square error, 94 
of two related signals, 95-98 
of sampled signals (see Sampled-data 
systems) 
Osder, S. S., 271-273 


Parseval’s theorem, 12 
Poisson’s distribution, 57 
Pontryagin, L. §., 217 
Prediction operator, 92 
Predictor control, 222-254 
of high-order systems, 234-253 
modified phase-plane control 
244-248 
multiple limits, 248-253 


, 


Predictor control, of high-order sys- 
tems, phase space, 234, 242- 
243 
switching surfaces, 234-238 
preliminary noise filtering, 243-244 
of second-order systems, 224-234 
physical realization with finite 
gain, 226-230 
switching boundaries with various 
pole locations, 230-234 
switching discontinuities in phase 
space, 238-242 
choice of continuous-phase 
variables, 238-239 
Probability density, 37 
(See also Distribution) 


Ragazzini, J. R., 110-111, 130, 347 
Regression, regression line, 337-339 
Response time, 12, 22 

Roll stabilization, 4 
Root-square-locus method, 23-29 


Sampled-data systems, continuous-sys- 
tem variables in, stationarity, 149 
criteria for optimum design, 126- 
127 
deadbeat, 126, 130-134 
prototype design, 131, 132 
modified, 133 
with zero error at all times, 
131-133 
with zero sampling error, 131 
with least-integral-square error, 
134-139 
with presampling filter and wave- 
shaping filter, 153-163 
with random inputs, 149-153 
sampled correlation functions and 
spectral densities, 140-149 
equivalent forms of, 140-143 
input-output relationships, 144— 
149 
with unsynchronized deterministic 
inputs, 153 
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Self-optimizing systems, 255 
basic elements, 255 
definition, 255 
extremal sensing, 256, 273-280 
(See also Optimalizing control 
methods; Stochastic approxima- 
tion; Stochastic tracking) 
input sensing, 257-261 
with controlled plant activities, 
258-261 
of signal and noise spectral 
densities, 257-258 
plant sensing, 261-273 
stability analysis of adaptive loop, 
264-270 
systems using model, 270-273 
flight control, 271-273 
by use of test signal, 261-264 
area ratio, 263 
correlation method, 263-264 
frequency response, 262 
impulse response, 262-263 
Sensitivity, of closed-loop poles, 
203-206 
applications, 203-206, 214 
damping, 205-206 
natural frequency, 205-206 
sampled-data systems, 214 
definition, 203 
of maximum modulus, 206—207 
of trarsient response, 193-203 
calculation by transform method, 
193-198 
definition, 193 
of nonlinear systems, 198-203 
parasitic oscillations, 197-198 
peak response, 206-207 
of sampled-data systems, 213-— 
214 
Sensitivity function, 7 
Ship steering, transverse and longitudi- 
nal mass, 90 
Shot effect, generalized, 49 
Smith, O. J. M., 348 
Spectral densities, measured, 
confidence limits, 190 


Spectral densities, measured, scattered 
nature, 164 
statistical and blurring errors, 168, 
173 
uncertainty of frequency and 
time, 165 
Spectral density, cross-spectral 
density, 42 
with independent variable other than 
time, 184-185 
multiple-dimensional, 185-188 
physical significance, 46-47 
properties, addition, 44 
filtering, 44 
symmetry, 43 
transmission, 43 
Spectral-density measurement, compu- 
tation from correlation functions, 
180-183 
direct, by analog method, 168-175 
averaging filter, 173 
co- and quadrature spectrum, 
173-175 
data speedup, 170-171 
duplex measurements, 189 
prewhiting, 189 
Spectral factoring, 16, 137 
Stationarity of continuous-system 
variables in sampled-data systems, 
149 
Stationary, nonstationary random 
process, 40 
Statistical terms, 325-336 
inductive inference, 334-336 
moments, 325, 329-332, 342 
population, 333-334 
sample, 334 
standard deviation, 325-327 
Stochastic approximation, 289-293 
peak seeking, 292-293 
simple interpretation, 289-290 
for specified performance parameter, 
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